erie 


tis Ss 


pagreseresee? 
orerere rays 


Digitized by the Internet Archive 
in 2022 with funding from 
Kahle/Austin Foundation 


https://archive.org/details/statisticalmetho0000trum 


TEXT-BOOK SERIES 
EpITED By PAUL MONROE, Pu.D. 


TEXT-BOOK IN THE HISTORY OF EDUCATION. ; 
By PAUL Monroe, Pu.D., Professor of History of Education, 
Teachers College, Columbia University. 


SOURCE BOOK IN THE HISTORY OF EDUCATION. 
FOR THE GREEK AND ROMAN PERIOD. 
By Paut Monroe, Pu.D. 


PRINCIPLES OF SECONDARY EDUCATION. 
By PAut MOonroE, PH.D. 


TEXT-BOOK IN THE PRINCIPLES OF EDUCATION. 
By Ernest R. HENDERSON, PH.D., Professor of Education and 
Philosophy, Adelphi College. 


DEMOCRACY AND EDUCATION. AN INTRODUCTION TO THE 
PHILOSOPHY OF EDUCATION. 
By JouN Dewey, Pu.D., Professor of Philosophy, Columbia 
University. 


STATE AND COUNTY SCHOOL ADMINISTRATION. 
SOURCE BOOK. 
By ELtwoop P. CUBBERLEY, PH.D., Professor of Education, 
Stanford University, and Epwarp C. Etiiott, PuH.D., Pro- 


fessor of Education, University of Wisconsin. 


STATE AND COUNTY EDUCATIONAL REORGANIZATION. 
By ELLwoop P. CUBBERLEY, PH.D. 


THE PRINCIPLES OF SCIENCE TEACHING. 
By GEORGE R. Twiss, B.Sc., Professor of the Principles and 
Practice of Education, Ohio State University. 


THE PRUSSIAN ELEMENTARY SCHOOLS. 
By THOMAS ALEXANDER, PH.D., Professor of Elementary Edu- 
cation, George Peabody College for Teachers. 


HOW TO MEASURE IN EDUCATION. 
By WILLIAM A. McCatt, Pu.D., Assistant Professor of Educa- 
tion, Teachers College, Columbia University. 
A HISTORY OF THE FAMILY AS A SOCIAL AND EDUCA- 
TIONAL INSTITUTION. 
By WILLYSTINE GOODSELL, PH.D., Assistant Professor of Edu- 
cation, Teachers College, Columbia University. 
THE EDUCATION OF WOMEN. 
By WILLYSTINE GOODSELL, PH.D. 


STATISTICAL METHOD. 
By TRUMAN L. KELLEY, Pu.D., Professor of Education, Stan- 
ford University. 


A HISTORY OF EDUCATION IN THE UNITED STATES. 
By PAuL Monrog, Pu.D. In preparation. 


STATISTICAL METHOD 


BY 
TRUMAN La KELLEY, Pa.) 


PROFESSOR OF EDUCATION IN STANFORD UNIVERSITY 


Tew Work 
THE MACMILLAN COMPANY 
1923 


All Rights Reserved 


Lae es re RT: Te, Ae, 
‘PRINTED IN THE UNITED ) STATES ¢ “A 
7 7 


- 
COPYRIGHT, 1923, 
By THE MACMILLAN COMPANY 
cree Se 
; Set up and electrotyped. | Published, May, 1923. 


PREFACE 


This book has been written with a view to serving two 
needs; that of biologists, economists, educators and psycholo- 
gists, who know little of higher mathematics, possibly care less, 
and who use statistical methods merely as a device to portray 
the facts of their group investigations; and that of those in the 
same fields who resort to mathematics to aid in the discovery 
of new truths. 

The elementary statistical needs in the four fields men- 
tioned seem to me to be the same and it is my aim to meet 
those needs and provide a foundation which will serve for ad- 
vanced work in any one of them. 

The approach to the essential principles developed is through 
concrete problems, only varying from this where simplicity of 
problems or the necessity for conserving space warrants. 

In order to provide a rigorous foundation for further statis- 
tical research — which would immediately take the economist, 
educator, or psychologist as well as the biologist into the fer- 
tile field developed by Karl Pearson and his co-workers — the 
notation follows that of the English school, making such sim- 
plifications as are possible for the immediate problems, but en- 
deavoring at no time to introduce a symbol, an approximation, 
or a lax proof which would have to be unlearned in undertak- 
ing more advanced work. The statistician cannot fail to note 
that the sheer visual weight of symbol, so appalling to the tyro, 
has been genuinely reduced by the introduction of a few new 
symbols in connection with multiple correlation. 

The fields represented by various correlation and other 
measures whose probable errors are unknown has been treated 
very succinctly. I can see no value except at times a slightly 
greater ease of manipulation, in using a measure whose prob- 
able error cannot be calculated if one with a known probable 

V 


vi PREFACE 


error and serving the same purpose exists. I have, therefore, 
simply included and defined such measures for those desirous 
of using them, without deriving or attempting to justify them. 

I particularly request the critical analysis by fellow statisti- 
cians of my determinations of probable errors, and such char- 
ity in reporting shortcomings as may be due one who has 
acted upon the policy that as shrewd an estimate as possible 
of the probable error of a statistical constant is better than no 
estimate at all. The derivation of probable error formulas has 
been one of the most difficult undertakings of this text and I 
cannot expect that the results are faultless. 

My statistical training has been rather desultory and it has 
occasionally been impossible for me to give due credit to the 
discoverers of well known formulas. 

I would, however, say that my greatest inspiration has been 
the product of that master analyst, Karl Pearson, and that 
the English school entire has been most contributive. My 
greatest indebtedness to men in America is to my teachers, 
Henry Lewis Rietz and Charles C. Grove, for enlightenment 
upon theoretical points and to Edward L. Thorndike for sug- 
gestions as to problems in need of statistical analysis. 


lence 


CHAPTER 


CONTENTS 


I. THE TABULATION AND PLOTTING OF SERIES 


SECTION 


2s 
3: 


. Introduction 
Statistical Series : f 
Construction of Statistical abies ' 


II. GRAPHIC METHODS . 


Mee Sie WS 


. The Histogram and Frequency Can 
. The Time Chart; Relative Time Chart; Chart 


of Ratios 


. Smoothing Data °. 
. The Ogive Curve . 
. The Growth Curve 
. The Graphic Representation ai Categorical 


Measures : 
MEASUREMENT OF CENTRAL TEND- 


ENCIES . 


IO. 
iN 
12. 
Te 
IA. 
inks 
16. 
ihe 


Averages 

The Arithmetic Mean 
The Median 
Percentiles 

The Mode . 

The Harmonic Mean 
The Geometric Mean 
Weighting . 


IV. MEASURES OF DISPERSION 


18. 
19. 
20. 
21. 
22. 
23. 
24. 


The Mean Deviation 

The Quartile Deviation . 

The 10-90 Percentile Range 

The Standard Deviation : 

The Standard Error of the Mean 

The Standard Error of Any Moment 

The Standard Error of a Class Frequency; be ihe 
Median; and of a Percentile . ae Bee 

vil 


Vili 


CHAPTER 


CONTENTS 


V. THE NORMAL PROBABILITY DISTRIBUTION 


SECTION 
25. 
20. 
27, 


28. 
20. 
30. 
aie 


Derivation of Equation of Normal Distribution . 
Certain Properties of the Normal Distribution 
Kelley-Wood Table of the Normal Probability 
Integral . Sas : 
Further Properties of the Normal Datracen 
Properties of Portions of a Normal Distribution . 
The Probability of Exceeding a Given Divergence 
Summary of Facts Concerning the Normal Dis- 
tribution 


VI. COMPARABLE MEASURES . 


Bee 
33: 
34- 
35: 


The Conditions Requisite for Comparer’ 

The Ratio Method 

The Standard Measure Method 

The Equivalence of Successive Percentiles Method 


VII. THE FITTING OF CURVES TO DISTRIBUTIONS 


36. 
37- 


38. 
39- 


4O. 


4I. 


Methods of Fitting Curves to Observations 

The Principle Underlying Pearson’s Method of 
Curve Fitting le 

Description of Types of coe 

The Fitting of the Most pore Tyres of 
Curves 

The Bearing of Cane sre upon Stability of 
Distribution 

Illustrations of Unstable Dictibations 


VIII. MEASURES OF RELATIONSHIP 


42. 
43. 
44. 
45. 
46. 
47- 


48. 
49. 


The Problem of Concomitant Variation in vite 
Sciences . 

Findings Reeling aes Gatton’ s Gisoue Treats 
ment 

Algebraic Catenion a Galton’ S Graphis indines 
and Derivation of Correlation Formulas 

The Detailed Steps in the Calculation of Correla- 
tion and Regression Constants i 

The Error Involved in Certain oprosniac une : 

The Bearing of Broad Categories upon Correlation 

Properties of Correlation Surfaces . 

Standard Deviations and Correlations of Various 
Constants 


161 


172 


175 


CHAPTER 


CONTENTS 


VIII. MEASURES OF RELATIONSHIP— Continued 


SECTION 


50. Formulas for the Calculation of the Product- 
Moment Coefficient of Correlation 

51. The Interpretation of Regression Gocfacenees 

52. Product-Moment Correlation of Non-Rectilinear 
Data : 

53. The Rank Method of ‘Caleniauing Corcaconl 

IX. FUNCTIONS INVOLVING CORRELATED MEAS- 
URES : 

54. Correlations of aire or Nore : 

55. The Reliability Coefficient 

56. Correction for Attenuation . 

57- Reliability of Averages ‘ 

58. The Probable Error of a Coeficient Comected for 
Attenuation 

59. Estimates of True Scores anal the iota Beer 
of These Estimates 

60. Accuracy of Placement on pace of a a Single Sean 

61. Average Intercorrelation 

62. The Effect of Different Ranges upon Cancion 
of Similar Measures : 

63. The Effect of Different Ranges upon Comelation 
of Different Measures . 

64. The Effect of Double Selection upon Correlation of 


Different Measures 


X. FURTHER METHODS OF MEASURING RELA- 
TIONSHIP : 


65. 
66. 
(ie 


68. 
60. 
70. 
Wil 
Wx 
73: 
74. 


The Various Ways of Mesurne Relaencnis 

The Median Ratio Correlation Coefficient 

Correlation Determined from a Curve of Cor- 
respondence by Rank . 

Correlation Ratio Method 

Method of Parabolic Regression 

Bi-Serial , Method 

Bi-Serial Eta . : 

Tetrachoric Correlation . : 

Correlation in a Four-Fold Point Burtace 4 

Measures of Correlation not Equivalent to the 
Product-Moment Coefficient; Yule’s Coefficients 
of Association and of Colligation . 


ix 


PAGE 


179 
181 


185 
IQI 


196 
196 
200 
204 
205 


208 


212 


214 
207 


221 
223 
228 


231 
231 
231 


234 
238 
245 
245 
249 
253 
259 


260 


x CONTENTS 
CHAPTER 
X. FURTHER METHODS OF MEASURING RELA- 
TIONSHIP — Continued 


SECTION , 
75. Measures of Relationship Interpreted in Terms 


of Probability 
76. Equi-Probable r 
44, Mean Square Contaseney ad Costicient of Cae 
tingency ir terre 
78. Variate iter Method : 


XI. MULTIPLE CORRELATION . 
49. The Problem . : 
80. Theoretical Treatment — 3 Variaoles : 
81. Three-Variable Problem Illustrating Meanings 
of Constants : 
82. The Use of the iioninent Chart 
83. The General Treatment of the m-Variable Brchient 
84. The Method of Successive Approximations 


XII. STATISTICAL TREATMENT OF SUNDRY SPE- 
CIAL PROBLEMS . : 
85. Statistical Constants Devennined froma Mutilated 


Distributions . 
86. Correlation Determed from Mutilated De 
tributions ; 
87. The Probable Error of Percentage Measures of 
Overlapping 


88. A Criterion for the AROS or Rian aece of 
Elements Having Fixt beaks 

89. Trade Test Calibration : 

go. The Determination of the roe -over Value Hae a 
Chromosome Section . 

o1. The Best Weighted Avernse. of acenendene 
Variables Ace he eee ae 

92. Psychophysical Methods: 


XII. INDEX NUMBERS 
93. The Bearing of Purpose ane Matera upon For 
of Index : 
94. The Meaning of a Prices Ratio a6 = a Ene Index 
95. The Probable Errors of Various Indexes 
96. The Accuracy and Flexibility of the Weighted 
Geometric Mean Index =a : 


PAGE 


eee 
* 320 


CONTENTS Xi 


CHAPTER PAGE 
XII. INDEX NUMBERS — Continued 
SECTION 


97. Criteria for Judging of the Excellence of Indexes . 341 


98. The'Use of Any Yearas Base . |. . . . = 346 
APPENDIX 

AN. SLANSMIE OU IMMER ANINGD SPAIMUSOIUIS 5 5 5 5 6 5 oo By) 
GANDIS INUASU NBII 5 ag ke eS RE 
1}, IBIMSWEMOWEAU NIRS 5 fk ae Ub ll 

C. KELLEY-WOOD TABLE OF THE NORMAL PROB- 
ANSIICTAINE WNRBICHUNG = 5 go me ge ee | HE 
DN) ome ee ie at ee, oe ere ona Mee ee On 


ALIGNMENT CHART. ... . ._ . tside back end-paper 


STATISTICAL METHOD 


CHAPTER I 


THE TABULATION AND PLOTTING OF SERIES 


Section 1. INTRODUCTION 


Two occasions for resort to statistical procedure, the one 
dominated by a desire to prove a hypothesis, and the other by 
a desire to invent one, have led to two schools of statisticians. 

The first school is that represented by mathematicians who 
start with certain elementary principles and deduce therefrom 
facts of distribution, frequency and relationship. In so far as 
observed situations parallel these conclusionsthe sameelementary 
principles are supported as applying to the data in hand. One 
weakness of this approach lies in the fact that a number of 
causes — different sets of elementary principles — may result in 
substantially the same net result. A still greater weakness is 
that it is essentially a deductive procedure and relatively sterile 
in suggesting new causes — in inspiring creative inferences. It 
is fundamentally a method of proof and not one of invention; and 
just because it is a method of proof, it has a permanent place in 
statistical method. It must, however, if in the service of the 
social and biologic sciences, be but a handmaid to the creative 
genius of mathematical analysis and induction. 

The second school is best represented by those biometricians 
and economists who start with observed data and endeavor so 
to group them and treat them that the constant features of the 
data are made apparent. This is a process of statistical 
analysis. It may at times be expected to be an involved process, 
for social phenomena are complex. Data are frequently warped 
to fit statistical convenience, but if statistics is to realize its 
high destiny, procedure must be flexible, for only when the 
method is mobile can it fit immobile data. The accurate 

I 


2 STATISTICAL METHOD 


measurement of those features of phenomena which are excep- 
tional is the unique province of statistical analysis. 

The method of approach in this text is inductive, starting 
with data and deriving constants, and will not give the nou- 
menal satisfaction that comes from tossing coins, throwing 
dice, and sorting cards, thus obtaining distributions which 
approach an ideal standard. 

Mathematical statistics form very much of a unit, and it 
is impossible to treat fully of topics in an order which does 
not call in earlier chapters for concepts developed later. The 
genuine unity of statistics is made apparent by these inter- 
relationships, and I have not attempted to avoid them. Terms 
used in an earlier part of the text than that in which derived 
are usually unambiguous on account of the context, but should 
there be any difficulty in understanding, the reader is directed 
to the bold face references given in the index and to the list 
of mathematical terms and symbols given in the Appendix. 


Section 2. STATISTICAL SERIES 


The treatment of this and the succeeding section largely 
follows that of Day (1919 and 1920). 

A statistical series is a succession of facts having some 
common characteristic. A series may be thought of as either 
giving (1) a location in time, (2) a location in space, (3) an 
indication of qualitative difference, or (4) of quantitative 
differences. 

(1) Trends in prices, rates of growth, fatigue, learning and 
forgetting curves, diurnal changes, etc., are illustrations of 
the magnitude of a variable with reference to time. Temporal 
series have certain characteristics which necessitate a technique 
in their interpretation which is peculiar to them. Any time 
series of appreciable duration (in studying etheric vibrations 
.oo1 of a second would be a very appreciable duration) may be 
expected to show periodic fluctuations. As a consequence one 
of two procedures is necessary, dependent upon whether (a) 
it is desired to study the changes within a certain cyclical 
period, or (b) to study trends independent of such periodic 
changes. Illustrations will make the problem clear: 


THE TABULATION AND PLOTTING OF SERIES 3 


(a) Let it be required to ascertain the nature of the load of 
an electric power generating plant during a twenty-four-hour 
period. The current consumed per hour for some one day 
could be tabulated or plotted. The result would have only 
such accuracy as would result from a single day’s sampling. 
To obtain a more reliable picture, a number of days could be 
combined and the tabulation made showing the average load 
for each hour of the twenty-four. Obviously error might 
creep in here, for the load on a Monday would be quite different 
from that on a Saturday or Sunday and perhaps different from 
that on the other days of the week. With due allowance for 
holidays, probably a very satisfactory idea of the hourly 
fluctuations of the Monday load could be obtained by pooling 
results for several Mondays. Differences in daylight, tempera- 
ture, etc., would make it unsound to combine all the Mondays 
in the year. The problem cited is typical of temporal series 
problems and the principle that should guide one in pooling 
results should be to group as wide a range of data as are typical 
with respect to the characteristic under investigation, but not 
affected by other seasonal or systematic tendencies. 

(b) Let it be required to ascertain the nature of the seasonal 
fluctuations of the load. In this case a tabulation by weekly 
units would be the best as this would completely suppress both 
Saturday and Sunday and hourly idiosyncrasies. With this 
in mind it is seen that a tabulation by six or eight day or 
monthly periods would not be as satisfactory as weekly or 
bi-weekly periods. The principle to follow is to use such a 
temporal unit as equals or is an integral multiple of the period 
within which occur the tendencies which it is desired to 
suppress. 

A second characteristic of a temporal series arises from the 
general lack of significance of the absolute value of a function 
at a given time. Interpretation depends upon the relation 
of the function at one time to its magnitude at a second time. 
This fact has led to the use of index numbers, or ratios of 
magnitudes. The magnitude at a stipulated time is considered 
basic and used as the denominator of all the ratios. The index 
number is not limited to temporal series, but it is more char- 
acteristic and more generally serviceable with them than with 


4 STATISTICAL. METHOD 


other series. Many considerations enter into the choice of 
the base, but if there is one time, such as a certain year, which 
more than any other shows a constant condition of the function, 
or an ideal or desirable condition, it will have special value as 
the base. 

(2) Just as index and periodic concepts are fruitful in in- 
terpreting temporal series, so is the map essential in portraying 
spatial series. Many spatial series show both qualitative and 
quantitative differences, in which case considerable ingenuity 
is needed to devise a map with cross sectioning, or color scheme, 
to portray the essential facts. Spatial series are intrinsically 
more amenable to graphic treatment, and less to numerical 
treatment, than temporal or quantitative series. The maps of 
the U. S. Coast and Geodetic Survey, of the Weather Bureau, 
and of the Census Bureau show the completeness, variety and 
detail of portrayal possible. The groupings of territories in 
spatial series and the subdivision of areas may follow conven- 
tional procedure or the peculiar needs of the problem. The 
order adopted by the Census Bureau in giving population 
statistics is as follows: 


TABLE I 

New England West North Central (continued) 
Maine Missouri 
New Hampshire North Dakota 
Vermont South Dakota 
Massachusetts Nebraska 
Rhode Island Kansas 
Connecticut South Atlantic 

Middle Atlantic Delaware 
New York Maryland 
New Jersey District of Columbia 
Pennsylvania Virginia 


East North Central 


West Virginia 


Ohio North Carolina 
Indiana South Carolina 
Illinois Georgia 
Michigan Florida 
Wisconsin East South Central 
West North Central Kentucky 
Minnesota ‘Tennessee 
Iowa Alabama 


THE TABULATION AND PLOTTING OF SERIES 5 


TABLE I (continued) 


East South Central (continued) Mountain (continued) 
Mississippi Colorado 
West South Central _ New Mexico 
Arkansas Arizona 
Louisiana Utah 
Oklahoma Nevada 
Texas Pacific 

Mountain Washington 
Montana Oregon 
Idaho California 
Wyoming 


(3) Qualitative series are those in which the classification 
is based upon the presence or absence of certain qualities. 
They lead to categorical distributions and are treated statisti- 
cally by means of the probabilities of frequencies, and by 
measures of relationship dependent upon the same — con- 
tingency coefficients, etc. The variability of a frequency is 
the basic concept in the statistics of qualitative series. 

(4) Quantitative series are those in which the classification is 
based upon the degree to which some measured trait is present. 
They are the most amenable to numerical treatment and their 
consideration comprises the bulk of this text. The variability 
of a distribution is the most basic concept in the statistics of 
quantitative series. 

Life’s problems do not confine themselves to single series, 
and certain methods have been developed for handling problems 
which are complexes of two or more of the four types men- 
tioned, but it is well to recognize that in general the problem 
and the method are functions of a single series. 


Section 3. CONSTRUCTION OF STATISTICAL TABLES 


The chapter which follows this deals with graphic methods 
and is concerned with charts, diagrams, graphs, etc., con- 
stituting pictorial representations of statistical series. The 
statistical table is quite different. Its purpose is not directly 
to give a picture of a sequence, but to provide the basic data 
from which such a picture, or at least the outstanding features 
of such a picture, may be determined and visualized if desired. 


6 STATISTICAL. METHOD 


The statistical table is simply a shorthand statement of facts. 
If a thousand or so facts of the sort, ‘‘The population of Aaber 
County is 4000;” “The population of Anthony County is 
3200;” ‘The population of Avery County is 4800;” etc., 
etc., are to be presented, they can not only be more concisely 
shown by tabulation, but several thousand additional facts, 
such as ‘‘The population of Anthony County is 800 larger than 
that of Aaber County’’ are presented at the same time and in 
an agreeably compact manner. The desire to accomplish 
double, triple, or manifold presentation by a single tabular 
arrangement is the desideratum which imposes conditions and 
determines appropriateness of procedure. 

The same facts in regard to population are shown in the 
following five tables, and while not exhausting the possibilities 
of presentation these will suffice to show the wide option which 
exists in presenting very simple data. 


TABLE II TABLE III 
Populations and Areas of Counties Areas and Populations of Counties 
POPULA- AREA AREA POPULA- 
CouUNTIES TION IN SQ. COUNTIES IN SQ. TION 
1920 MILES MILES 1920 
Aaber . .| 4,000 480 Aabers. 2 480 4,000 
Anthony 5 | Seo 400 Anthony : 400 3,200 
NV CLY a eee 4, S00 800 Avery . . 800 4,800 
Bascomb _. _| 16,000 700 Bascomb 700 16,000 
Brows a ete |ie3, 000 600 Brown we ee 600 3,000 
TABLE IV TABLE V TABLE VI 
Counties arranged ac- Counties arranged ac- Counties arranged ac- 
cording to Population cording to Population cording to Population 
POPULA- PopuLa- PopuLa- 
COUNTIES TION COUNTIES TION TION CouNTIES 
1920 1920 1920 
Brown - +. 3,000 Bascomb . .16,000 16,000 Bascomb 
Anthony . . 3,200 Avery. . . 4,800 4,800 Avery 
aber Mee. 9. 4,000.8) Saber 3) ee 84:000 4,000 Aaber 
AVETyY Ws) 4,000) Anthonyar.m. 3/200 3,200 Anthony 


Bascomb . .16,000 Brown eS. O00 3,000 Brown 


THE TABULATION AND PLOTTING OF SERIES 7 


As judged by a single purpose no two of the tables given 
are equally meritorious. If the table is to be used more 
frequently in abstracting information about various counties 
than as a means of comparing counties, i.e., if it is a reference 
table and not one pointing some conclusion, the items in the 
stub (the first column) should be arranged alphabetically as 
in Tables II and III in order to facilitate the finding of items 
desired. If populations are more likely to be studied than 
areas, Table II is preferable to Table III, as the Population 
column holds a dominant position in Table II. 

Should it be intended that the table be not primarily a refer- 
ence table arranged to simplify the extraction of items of in- 
formation, but, let us say, to point conclusions with reference to 
populations, Tables IV, V, or VI are preferable to Tables II 
or III. If counties of large population are the chief con- 
sideration, Table V is preferable to Table IV, as the first row of 
a table ranks higher in dominance than successive rows. Next 
in importance is the last row. Totals or averages are, because 
of their importance, frequently placed in the first row, but if 
other items demand this position or if captions (headings of 
columns) are less readily interpreted when separated from the 
body of the table by a row of totals or averages, then the 
bottom row may be used. 

As a means of pointing conclusions dependent upon popula- 
tions Table VI is to be preferred to Tables IV or V, as the popu- 
lation data hold the dominant position in Table VI. 

In general one should so draw up the table that the items in 
the stub and the captions constitute the argument or informa- 
tion with which the table is entered, and so that the column 
and row next to the stub and captions contain the most impor- 
tant items to be obtained from the table. Rows and columns 
more removed from these dominant positions should contain 
less important data, except that the last row and last column 
may be given to data of first or second importance. 

Such Tables as II and III are primary or general purpose 
tables, since they contain the raw data without abridgment, 
and may be used for various purposes. Such Tables as IV, V, 
and VI are derived from primary tables, such as II and III, 
and by emphasizing certain facts serve a special purpose. 


8 STATISTICAL METHOD 


These two types of tables should be recognized. The special 
purpose table is always published because it conveys the point 
of the study. The general purpose table should always be 
published also, as it provides the only means of checking the 
author and of discovering if other or further conclusions can 
be drawn. Several tables and many calculations may be in- 
volved between the primary and the final derived table. If 
full description of these intermediate steps be given it is not 
essential that these intervening tables and calculations be 
published. 


CHAPTER II 


GRAPHIC METHODS 


Section 4. THe HistoGRam AND FREQUENCY PoLYGON 

The picturing of facts, when the nature of the data permits, 
conveys a readier comprehension than is possible from any 
array of figures. The accurate graphic portrayal of data is 
therefore the problem of this chapter. 

Since there are but two dimensions to the surface of a sheet 
of paper, ordinarily but two series of facts are shown in a single 
graph. Consider the accompanying data giving the maximum 
temperatures recorded by the Weather Bureau for each day 
in July and August, 1917, for New York City. 


TABLE VII 


Maximum Temperature for Each Day 


July 1-Aug. 30, 1917 


IN. Y. City 

ul I 80 li ay,  GYy/ Aug. I 98 Aug. 17 85 
es 2 88 a 18 80 2 96 18 80 
Beer 4 19.77 3 83 19 81 
4 78 20 83 4 80 20 84 
5 81 Oil 5 82 21 85 
6 80 22 86 6 82 22 ae 
23 86 7 88 22m 
§ ie 24 86 8 78 24 83 
9 75 25 84 9 83 25 82 
IO 65 26 85 10 80 BXS) Gif 
Il 66 By Xo) Tigo Dy (Ps 
127A 28 80 120o3 28 80 
Te {Sit 2OmEO ToS 29 83 
14 81 30 95 14 78 30 81 
15 75 31 98 15 81 25) GAS 

16 85 16 80 


10 STATISTICAL METHOD 


TABLE VIII 
Tally Sheet 


TEM- No. or Days TEM- No. oF Days 
PERA- WITH GIVEN PERA- witH GIVEN 
TURES TEMPERATURE TURES TEMPERATURES 
65 | 82 a | 

66 | 83 ba Oe 
67 84 | | 

68 85 He ie 

69 86 ia 

70 | 87 | 

al | 88 | | 

72 89 

73 90 | 

74 | | gt 

75 | bi 92 

76 93 

77 | 94 

78 LPI 95 

79 | 96 | 

80 Ie olallot eee. 97 

81 ewes x ewe 98 | | 


z — 


If it is desired to study diurnal changes in maximum tempera- 
tures a graph could be made in which the abscissa (the hori- 
zontal dimension) represents the days in order, July 1, July 2, 
etc., and the ordinate (the vertical dimension) represents the 
temperatures in order, 0°, 1°, 2°, etc. For July 1 the ordinate 
would be 80, for July 2, 88, etc. A line connecting the suc- 
cessive ordinates would give a picture of the changes in maxi- 
mum temperature throughout the two months. Or, it may be 
desired to disregard the sequence of the days and obtain a 
general idea of what constitutes the maximum temperatures 
for days in New York during July and August. In this case 
the abscissa will represent temperatures and the ordinate the 
number of days. To do this, Table VIII is first made out 
from the data in Table VII and then plotted as shown in 
Charts I or IT. 

Chart I is a histogram or a pictorial representation by means 
of rectangles, telling precisely the same story as a table of 
frequencies, such as Table VIII. Chart II is a frequency 
polygon. It is not a series of discrete elements as are the raw, 
gross, or original, measures, but a closed figure, each part of 


GRAPHIC METHODS II 


which is connected with the next, giving the idea of continuity 
in the measures. Each of these graphic forms has its ad- 
vantages; the histogram in case heights of rectangles are to 
be accurately compared; and the frequency polygon if the 
idea of continuity is desirable. Note that in drawing the 
frequency polygon points a and ¢ are connected and not 
points b and c 


CHART I 


Histogram showind maximum temperatures 


z for days from uly 1- Ave 1, 1SIT NewYork City 
reewiolele leds tarel | 1a) t] C1] 
pas UUCeeRoGn FEFELEE Beds PEEEEEEEEEEECEH 
ee ei lol 11 1 ae taal aneG @ 
2 Dt, AEE ES ECE ieee 
Scr. SHS6asueggca HEH 4 PeeeCesaes 
Seeel lateb {1 [e ECEEEEEHEEEEECEETEE 
aon aaGeoeoe 
4 HHH 
es etd aeeeeee on HL HE Ep 
oO 
y 3 ak a crt H aguas Z = 
— oO 
O} - 2 A €5 666168 69 70 Tl 12 13 415 aod 82 83 84-85 & 67 88 89 90 91 92 9394 95 96 91 98 99 
Temperatures 
CHART IT 
A 
§ Frequency Polygon for samme ate 
(1 
alee ieetayrie ot 14] = 
(seers enter Beeus i agooa 
eo eee intel LOAN Ht 
“SA OE A HE 
Peeeierseoy tat mul 
Pe a 9 a Se a avai 
Pe RSE EEE Hoy iH itt HtHE 
pone ele CANE 
pe eH ATTY MAE TA 
2 }HLNTUN TV EEE AK 


GAGS CGGT CS 6910 TI 12 TS 7A TS % TT 78 73 808! SN I 93 4-95 % 97 38.99 
Temperatures 


Great care should be taken to insure that the graph agrees 
with the labels of the codrdinates. Note that the class index 
“65” designates the mid-point of the interval, the lower limit 
of which is 64.5 and the upper limit 65.5; that in the polygon, 
point c is directly above the class index 65, and that in the 
histogram the class index 65 designates the mid-point of the 
horizontal dimension of the rectangle. 


12 STATISTICAL METHOD 


It is allowable to label the beginning and end of the interval. 
In such case the histogram or polygon would be drawn exactly 
as given and point b would be labeled ‘‘64.5’’ and under no 
circumstances “65.” 

It has become somewhat customary in educational fields 
to speak of a child as solving 10 problems in a speed test, 
meaning thereby that 10 problems were solved and the r1th 
started but not finished when time was called. In plotting 
the distribution of scores the designating number, 10, has been 
placed at the beginning of the interval. No objection should 
be made to this were the numerical computations in harmony 
with this procedure, but very generally such scores have been 
treated as exactly 10.0 in calculating arithmetical averages 
with the result that the curve and the constants computed 
from the data do not agree. Not uncommonly such scores 
have been treated as 10.0 scores in calculating means and as 
10.5 scores in calculating medians, with the result that a com- 
parison of mean and median scores gives an entirely erroneous 
impression as to the skewness of the data. This faulty pro- 
cedure has probably been followed unwittingly, but unfortu- 
nately with the sanction of teachers. The following is quoted 
from page 50 of the Second Year Book — Division of Educa- 
tional Research, Los Angeles, July ror1o: 


“LESSON SIX — THE ARITHMETIC MEAN 
Method of Finding the Mean 


No. PROBLEMS No. Pupits 


12 3 3 X 12 = 36 
II 5 Sexo che 55 
10 7 (ae 1Ok==770 
9 4 AX OL 530 
8 2 2X 8=16 

21 213 


213 divided by 21 equals 10.14 the mean. The median in the 
same distribution would be 10.64.’ In this lesson problem 
the mean is in error if r2 implies the interval 12.0 to 13.0 and 
the median (see Section 12) is in error if it implies the interval 
11.5 to 12.5. The error here cited probably grew out of an 
error in labeling a distribution. Uniformity is needed, and 
it would be in harmony with well-nigh universal procedure in 


GRAPHIC METHODS 13 


the physical and biological fields to consider a score of 10 as 
being also a class index, or mid-point of an interval. Should 
this lower the grade of a few million school children by one half 
a point no harm would be done and the great advantage of 
having the recorded test score measures exactly those to be 
used in calculating means, standard deviations, correlations, 
etc., and of having the recorded measures also the class indexes 
in graphs is attained. Throughout this text a score no matter 
how derived originally is uniformly to be interpreted as cover- 
ing an interval extending from half a unit below to half a unit 
above. The accompanying data provide a nice problem in 
plotting where the distribution is decidedly asymmetrical; 
where a part of the distribution is lacking; where the class 
intervals (i.e., range covered by successive groups) are unequal; 
and where the existence of a few excessively extreme measures 
makes it impossible to select codrdinates (abscissas and ordi- 
nates) which satisfactorily reveal the entire distribution. 


TABLE IX TABLE X 


British Income-tax Payers — 1914 
American Consular Report, May, 1915 


INCOME No. or ASSESS- TRCOME No. or ASSESS- 


MENTS MENTS 
£ 160 to 200 257,499 25 (0) 186) 40 150,000 
200 300 237,434 40 80 750,000 
300 400 85,557 80 120 1,680,000 
400 500 46,063, 120 160 1,400,000 
500 600 23,411 160 200 400,000 
600 700 13,383 200 300 390,000 
700 800 10,250 300 400 97,000 
800 900 5,779 400 500 49,000 
900 1,000 7,445 500 600 24,000 
1,000 2,000 16,363 600 700 14,000 
2,000 3,000 3,381 700 800 10,000 
3,000 4,000 1,231 800 900 6,000 
4,000 5,000 678 900 1,000 7,000 
5,000 10,000 882 1,000 2,000 17,000 
10,000 and over 390 2,000 3,000 3,000 
—_— 3,000 4,000 1,000 
709,746 4,000 5,000 700 
5,000 10,000 goo 
Ic,000 and over 400 


Notice that the first class interval covers a range of £40 
while the next to the last extends over £5000 and that the last 
interval extends over an amount not recorded but probably 


14 STATISTICAL METHOD 


as large as £100,000. No scale which will satisfactorily picture 
the £40 class interval will be satisfactory for a £100,000 
interval. The curve below (not the insert curve) pictures as 
much of the distribution as possible. Even with an interval 
of £1000 to a distance of one-half inch, space does not permit 
of showing the last interval. Having omitted this class it is 
necessary to make note of the fact as has been done in the 
lower right hand corner of the chart. 


CuHART III 


Distribution of Incomes in Great Britain 


2 Large Curve-from Am.ConsularReportMay |3I5 
9 ¢000- Insert Curve-Hupothetical,covering all Incomes 
oS 40 

€ 

© 

7° 30 

9 

is 

qy 4000 


N 
i=) 


g 


Ho. of assessments in thousands 
3 


=f 
aH 
: 
a 
stl 


Income in Paunds § 


No. of individuals having th 
N 


° 1000 2000 3000 4000 5000 6000 TOO ¢ 
Income in Pounds 


Since the first interval is £40, the second £100, the tenth 
£Lrooo and the fourteenth £5000 it is impossible to plot ordinates 
proportionate to the frequencies: 257,499; 237,434}; 16,363; 
and 882; and truly picture the situation. Some account must 
be taken of the difference in size of intervals, for the ordinate 
should represent the number of cases per unit interval. Ac- 
cordingly 257,499 has been divided by the interval represented, 
40, giving 6437, the number of persons per range of £1; 237,434 


GRAPHIC METHODS 1 


divided by 100, giving 2374, etc., which quotients are the 
heights of the ordinates representing the respective classes. 

The ordinates have been joined by a smooth line to empha- 
size, even more than does the frequency polygon, the idea of 
continuity. A polygon or histogram is generally to be pre- 
ferred, as it is less likely to be misleading. 

Having the data of Table IX for incomes above £160 it is 
possible to make a sufficiently close estimate of the total 
distribution of wealth in Great Britain as to suggest what the 
major features of the actual distribution would be. Let us 
therefore assume the total distribution of wealth to be as 
recorded in Table X and investigate its salient features. 

The plot of the data of Table X is given in the insert. Since 
the abscissa scale is much larger than before it has been impos- 
sible to plot the entire distribution without breaks. These 
breaks are indicated, as should always be the case, by prominent 
pairs of zig-zag lines. Note that the ordinates, which were 
obtained as before, are plotted at the mid-points of the intervals, 
e.g., there are 390,000 individuals receiving incomes from 
£200-£300, or 3900 per £1 of the range. This ordinate, 
3900, is erected at £250, the class index and also the mid- 
point of the interval. 

The shape of the curve indicates that there were more than 
3900 per £1 for incomes between £200 and £250 and less than 
this number per £ for amounts between £250 and £300. 

It may also be noted that since a curved line connects the 
points, the area lying under the curve and between £200 and 
£300 will not total exactly 390,000 as it should. In curves 
smoothed by visual inspection such inaccuracy is practically 
unavoidable. For these particular data a frequency polygon 
would be still less satisfactory as it would indicate a mode at 
£100 whereas, assuming the hypothetical data to be correct, 
the mode is somewhat above that amount. A histogram would 
give the most accurate presentation, but would be less satis- 
factory in other respects. The total area in a given histogram 
interval is accurate, but the rectangular distribution within 
the interval indicated by the histogram may be quite inaccurate 
if the interval is large. 


16 STATISTICAL METHOD 


Section 5. Tur Time CuHart; RELATIVE TIME CHART; AND 
CHART OF RATIOS 


Charts have been presented in which the ordinates were 
frequencies and the abscissas amounts in a gross score. Such 
graphs are ordinarily characterized by small frequencies at 
either end of the distribution and a single mode somewhere in 
between. If, however, frequencies are plotted as ordinates, 
and periods of time as abscissas, a different type of curve 
is found, for generally with the passage of time the function 
continues to grow or at least persist. The following data and 
chart are characteristic: 


CHART IV 
Growth in Population 


United States eX (oa, EY 

California f—73° 

Sages 
d 2G 


Population of States in Hundred Thousands 


Vv) 
£ \oo ZO 
Ce) 
oD 18 
= 80 IS 
c To 14 
0 So Iz 
a 50 ite] 
Ue 
0 40 8 
c 
9 30 ce 
6 
20 
= 4 
= 10 a 
(ave 
o °o 
1850 \8co0 \870 1880 1890 4300 1s10 1920 
ATS 


Note that the right hand axis is labeled from the bottom up. . 
Simplicity and clearness can frequently be obtained by labeling 
the lines in a chart and omitting the legend. 


GRAPHIC METHODS 17 


TABLE XI 
Population in Thousands 


1850 | 1860 | 1870 | 1880 | 1890 | 1900 | to10 | 10920 


CALIF. . 93 380 560 865 | 1,213 | 1,485 | 2,378 | 3,427 
ORED =. 13 52 QI 175 318 414 O72 783 
ee . 12 24 75 W357 oto nl 42 es 7 
ENTIRE . 23,192) 31,443] 38,558) 50,156/ 62,948] 75,995] 91,972| 105,711 


The graph shown illustrates the use of a single set of abscissas 
and two sets of ordinates for the plotting of two kinds 
of curves upon the same chart; (1) population of the United 
States in millions and (2) population of States in hundred 
thousands. This method is usually very misleading and the 
present illustration is no exception. Double ordinate charts 
can be used with less error if, going with changes in time 
there are changes in the general direction of the curve, i.e., 
if it rises and falls, for then if a second curve also showing such 
fluctuations in direction of trend is plotted on the same chart 
it is possible to compare the one with the other as to direction 
of fluctuation, but it is not possible at all accurately to com- 
pare them as to magnitude of fluctuation. The method should 
be used with very great parsimony and precaution. 

For the chart shown the comparisons which can validly be 
made are those of absolute growth between state and state. 
The curve for the entire United States confuses rather than 
helps in the comparison. Absolute growth in the United 
States cannot be compared with absolute growth in the states 
as the scale is 1/50 that used for the states. Relative growth 
in the United States and in the states cannot be determined 
by comparing the slopes of the curves —e.g., the slope of 
the curve for the United States between 1900 and 1og1o is 
steeper than that for Oregon for the same years, but the per- 
centage growth for that period for the United States is 


21 (ee on x 100) which is less than the percentage 
99 


growth for Oregon, 63 (275 —4"4 * 100). Likewise it is ap- 


18 STATISTICAL METHOD 


parent that relative growth of state and state is not shown by 
these graphs. 


The Relative Time Chart 


Relative growth could be shown by plotting the populations 
for the several years in terms of some one year as a base, or 
“relative.” For the data in hand this would be unsatisfactory 
for no matter what year is taken as the relative (e.g., 1850, .. . 
1910, 1920) the resulting graph would be difficult of accurate 
and significant interpretation. If change over a short period 
only is under consideration, relative curves reveal significant 
tendencies, especially if the measures, in particular the base 
measure, are large with respect to fluctuations. 

The following data permit of portrayal in graphs, either in 
terms of original scores or as ratios. 


TABLE XII 
Chicago Data * 


U.S. ENTIRE | Ay. YearLy UNION WAGE PER Hour 
ETAIL 
YEAR Dunn's PRICE ; 

pyncrpsats | Rounr | Pointers | Sinotype | Carpenters 
1907 -. - 107.264 14.3¢ 50¢ 50¢ 56 3¢ 
LOOS anes 113.282 14.9 50 50 56.3 
LQOOQ sea 111.848 15.9 55 50 56.3 
TOLOMe me 123.434 16.2 60 50 60 
TOT. e es 115.102 15.9 60 50 60 
LOL 2am 123.438 19.1 60 50 65 
TOUS Sears 120.832 20.2 65 50 65 
TOLAS rae 124.528 PPA | 70 50 65 
Mmopiey eo 124.168 21.2 70 50 65 
TOLOw ee. 137.666 22.6 70 50 70 


* U.S. Dept. of Labor, Bur. of Labor Statistics. Union Scale of Wages and Hours of 
Labor, 1916. 


Chart V is a graph of the data of Table XII and Chart VI 
of Table XIla. In Chart V there are various breaks in the 
vertical scales permitting the use of three different sets of 
values. The location of the word ‘‘Date” in Chart VI is 
preferable to that in Chart V. 


GRAPHIC METHODS 


19 
TABLE XIla 
(Prices and wages expressed as ratios,* 1907 as base) 
Chicago Data 
RETAIL 
AVERAGE UnIon WAGE PER Hour RELATIVE 
Dunn’s YEARLY PRICE 
Y WHOLESALE RETAIL 
ANS PRICE PRICE 
INDEX RouND , PomOomitnion 
STEAK Painters ee Carpenters Articles 
DerALors of Food 
1907 100 100 100 100 100 100 
1908 106 104 100 100 100 105 
1909 104 III 110 100 100 109 
IQIO 115 113 120 100 107 113 
IQII 107 ero 120 100 107 113 
1912 115 134 120 100 115 121 
1913 113 141 130 100 115 120 
1914 116 156 140 100 II5 124 
1915 116 148 140 100 115 124 
1916 128 158 140 100 124 138 


* The decimal point is omitted, as usual, so that a ratio of ‘‘106’’ means a six per cent 


increase. 


Increase in Wholesale Prices, 


CHART V 


Retail Prices, and Wages 


Retail Steak Prices. 


Wholesale 


RetailSteak 


20 STATISTICAL METHOD 


CHART VI 


Increase in Wholesale Prices,Retail 


Prices & Wakes ~ Relative +o 1907. 


Lepend. 
-—-Retail- Steak 
~--Retail-z22 Articles 
xxxWabes-Linotype Op. 
o—Warpes-Car penters. 
Wades-Painters a 


Ratios on 1907 as a base 


1907 "08 ‘o9 “10 ‘Ih “IZ “13 "4 5 1916 


Neither of the accompanying graphic presentations is with- 
out serious drawbacks. From Chart V it is possible to infer 
that the retail price of round steak and wholesale prices of 
food products both dropped from 1910 to 1911 but it is not 
possible to judge which suffered the greatest relative decline. 
Chart VI does show that relative to 1907 wholesale prices 
suffered most. 

Chart VI gives the impression that painters are better off 
than carpenters, — relative to condition in 1907 they are, 
but in no other sense as Table XII shows. A relative table 
or chart shows facts relative to condition at date of base and 
nothing else, which is a point that must be stressed or it will 
be overlooked by the untrained reader. A gross measure- 
ment table, or chart, reveals gross changes and directions of 
relative changes but not the magnitude of relative changes. 

Another inaccuracy which is commonly present in ratio 
measures and accordingly in charts based upon them, is due 


GRAPHIC METHODS a 


to the fact that variations in ratios are frequently large with 
respect to the base used. Prices may increase or cities grow 
IOI, 200...1000 per cent, but it is impossible for them to 
decrease by such amounts. A change in ratio from 50 to 100 
means more than a change from 100 to 150 though they show 
up the same when plotted. Similarly in terms of genuine sig- 
nificance; to pass from a ratio of 20 to one of 30 is greater than 
to pass from one of 30 to one of 4o. 

To illustrate certain of the tricky features to be guarded 
against in the use of ratios the following data and graphs are 
given: 


CuHart VII Cuart Vila 
Percentage of Number oF 
Male Teachers in Male Teachers in 
the High Schools Bayes the High Schools 

175,000 
150,000 
5 125,060 
100,000 
78000 
60,000 
2500 
I9IO < 1900 ISIO 
Ratio Chart Gross Frequency 
Chart. 
TABLE XIII 


Number of Teachers in the Public High Schools of the U.S. 
Report of the Commissioner of Education, 1913, v. 2, pp. 9-10 


1900 IQgIo 
Men. .| 10,172 = 50 per cent of total | 18,890 = 45 per cent of 
49.931 more exactly total 
WoMEN . | 10,200 = 50 per cent of total 45.336 more 
exactly ; 
22,777 = 55 per cent o 
TOTALS - | 20,372 ee 


41,667 


22 STATISTICAL METHOD 


From a casual glance at these charts it would be hard to 
realize that they are both accurate representations of the 
same data. A few pertinent questions might be asked: 

(1) If the tendency shown by the ratio chart (tendency 
based upon the actual data for 1900 and 1910) continues, 
what will be the proportion of male teachers in the year 2Q00? 
Answer .03981. 

(2) If the tendency shown by the gross frequency chart 
(tendency based upon the same actual data) continues, how 
many male high school teachers will there be in the year 2000? 
Answer 97,352. 

(3) With the proportion as shown in your answer to ques- 
tion (1) and the number of male teachers as given in your 
answer to question (2), how many women teachers would there 
be in the high schools in the year 20007 Answer 2,348,064. 

If the reader sees through this situation he appreciates one 
of the fallacies likely to arise through the use of proportions. 
Another occurs in combining ratios 


Time Ratios 

To average a number of ratios to obtain a single index, in 
general leads to an error. This will be considered later, but 
to illustrate the fact that ratios do not group themselves in 
a symmetrical manner around their own mean, the following 
data from Mitchell are given as quoted by Secrist. (1917, 
p. 312.) They also provided the material for an important 
problem in plotting. 

It will be noticed that the class intervals extend over ranges 
of two units, e.g., there are five class intervals in covering a rise 
in prices from ro per cent to (but not including) 20 per cent. 
With no direction to the contrary it is to be presumed that the 
class designated in the table by ‘54 — 55.9” includes all 
measures with values between the limits 53.95 and 55.95; 
that the next class includes measures between 51.95 and 53.95; 
etc. This is to say that presumably the data have been recorded 
to but one decimal place so that such measures as 53.86 and 
53-92 are called 53.9 and a measure such as 53.96 is recorded 
as 54.0. If the recorder encountered a measure 53.95 he had 
to arbitrarily decide whether it would be called 53.9 or 54.0. 


GRAPHIC METHODS 23 


TABLE XIV 


Distribution of 5578 Cases of Change in the Wholesale Prices of Commodities 
from One Year to the Next 


PER CENT on CHANGE | PER CENT a CHANGE 
FROM THE VERAGE NUMBER OF FROM THE VERAGE NUMBER OF 
H 
ghee se Cases Soe sabe oe 
(FALLING PRICES) (RISING PRICES) 
54-55-9 I 14-15.9 106 
— — 16-17.9 102 
50-51.9 I 18-19.9 73 
48-49.9 I 20-21.9 65 
46-47.9 I 2223.9 45 
44-45.9 2 24-25.9 47 
42-43.9 4 26-27.9 29 
40-41.9 5 28-29.9 30 
38-39-9 5 3031.9 22 
30-37-9 7 32-33-9 17 
34-35-9 10 34-35:9 18 
32-33-9 7 36-37.9 II 
30-31.9 16 38-39.9 17 
28-29.9 27, 40-41.9 14 
26-27.9 17 42-43.9 6 
24-25.9 32 44-45.9 10 
22-23.9 39 46-47.9 II 
20-21.9 45 48-49.9 5 
18-19.9 7 50-51.9 I 
16-17.9 76 52-53-9 4 
14-15.9 107 5455-9 3 
12-13.9 120 56-57-9 I 
IO-1I.9 173 58-59.9 6 
8- 9.9 200 60-61.9 4 
6= 7.9 238 = ae 
4- 5.9 329 66-67.9 4 
2- 3.9 Si 68-69.9 8) 
Under 2 405 70-71.9 I 
No change 697 HET E®D 4 
orig Perees) 7475-9 “y 
nder 2 10 
2- 3.9 ee 80-81.9 I 
4 5.9 356 ote : 
6- 7.9 261 84-85.9 l 
8- 9.9 237 86-87.9 I 
I0-I1.9 167 =a -o 
= 100-I01.9 I 
ee oe 102-103.9 I 
5,578 


For the data in hand it is not known how such a case would 
have been decided, but a very good rule to follow is to always 
assign such a critical measure to the even instead of the odd 


24 STATISTICAL METHOD 


value, i.e., the measures 53.95, 54.05, 54.15, 54.25, 54.35 and 
54.45 would be assigned as 54.0, 54.0, 54.2, 54-2, 54-4 and 54.4 
respectively. It will be noticed that in the long run this 
introduces no systematic error for the 3 is thrown away as 
often as it is added. It does result in a slight piling up of the 
even measures, but that is generally inconsequential, whereas 
the adding of a half every few measures would result in a 
cumulative error which might be serious. 

If the class intervals run in order from 53.95 to 55.95, 51-95 
to 53.95, ... 1.95 to 3.95 it is found that the next frequency, 
in order to extend over the same range, would be from — .o5 to 
1.95, i.e., from an increase in price of .o5 per cent to a decrease 
of 1.95 per cent. This, however, cannot be the case, as a very 
large frequency, 697, is recorded for ‘‘no change.’”’ The way 
the data are recorded would suggest a class interval correspond- 
ing to ‘‘no change,”’ but this cannot be so, as the intervals on 
either side preémpt the space. In plotting the data, therefore, 
the ‘‘no change’’ interval must be squeezed out and its fre- 
quency, 697, distributed between the neighboring classes. We 
will assign 348 to the ‘‘under 2— Falling prices”’ interval, and 
the remainder, 349, to the ‘‘under 2 — Rising prices”’ interval. 
There still is a slight discrepancy (.05) in the ranges of these 
two middle intervals, but as it cannot be positively accounted 
for without recourse to the original data it is passed over. 

For convenience in tabulation and plotting we will consider 
the first class interval to extend from 54.00 to 56.00 and to 
have its mid-point or class symbol 55.00, the second a mid- 
point at 53.00, etc., and the frequencies as before. 

The frequency polygon seems better suited to the data in 
hand, as it gives the impression of a more pronounced mode 
than would a histogram and in this case this feature should 
be emphasized. 

Three ways of connecting the points of a distribution have 
been presented: (a) by drawing a histogram — Chart I; 
(b) by drawing a frequency polygon — Chart II; (c) by draw- 
ing a smooth curve through or near all the points which fits 
the data as nearly as can be determined visually — Chart 
Ill. A fourth way (d) is to plot from smoothed data; and a 
fifth (e) is by mathematically determining the equation of 


GRAPHIC METHODS 25 


the curve which best fits the data and plotting the same. 
This last method is discussed in Chapter VII. Methods (a), 
(b), and (e) preserve areas, i.e., the total area under the curve 
is equal to the population, or number of cases. Method (e) 
also preserves other important features. In using method 
(c) there should be a definite attempt to preserve areas; that is, 


TABLE XV 
PER CENT Crass INTERVAL PER CENT Crass INTERVAL 
OF CHANGE OF 4 PER CENT OF CHANGE OF 4 PER CENT 
FROM THE NUMBER FROM THE NUMBER 
AVERAGE OF AVERAGE OF 
PRICE OF | Casgzs | PER CENT| NUMBER || PRICE OF | Cacps | PER CENT| NUMBER 
THE PRE- OF OF THE PRE- OF OF 
CEDING CHANGE CASES CEDING CHANGE CASES 
YEAR YEAR 
— 56 I — 5) 329 
= Sh) I — 4 704 
= is = © 375 
— 52 LS ee 753 
= 51 I 16) I512 
— 4) I I 759 
— 48 2 3 355 
— 47 I 4 711 
45 2 5 356 
— AA 6 7 261 
= 43 4 8 498 
=A 5 9 237 
— 40 10 II 167 
— 39 5 12 282 
meer 1, i 13 II5 
= 35 G/ 15 106 
— 35 Io 16 208 
= 22 7 \ Wii 102 
== 22 23 19 73 
— 31 16 20 138 
— 29 By 21 65 
— 28 44 23 45 
= Zi uy 24 92 
—25 32 25 47 
= Ph, 71 27 29 
— 23 39 28 59 
— 21 45 2 30 
20) 116 31 22 
— 19 71 32 $9 
pa 17 76 33 17 
— 16 183 35 1S) | 
— 15 107 36 29 
rg 120 37 II 
=z 293 39 17 
—iII 173 40 3l 
—- 9 200 ; 4I I4 
re 438 43 6 
= ¥f 238 44 16 


26 STATISTICAL METHOD 


TABLE XV (continued) 


PER CENT Crass INTERVAL PER CENT Crass INTERVAL 
OF CHANGE oF 4 PER CENT OF CHANGE OF 4 PER CENT 
FROM THE ——— | PROM THE 
caebengae oe svenace id Janet 
RICE OF PER CENT} NUMBE RICE OF PER CENT| NUMBER 
THE Pre- | CASES ai pes E Neeae Paes canes ae se 
CEDING CHANGE CASES CEDING CHANGE CASES 
YEAR YEAR 
45 10 73 4 
47 II 75 I 
48 16 — — 76 
49 5 80 i 
5I I 81 I 
52 5 83 I 
33 4 84 2 
a0 3 85 I 
56 4 87 I 
bye : 88 I 
59 — = 
60 10 92 fe) 
61 4 96 oO 
~ —— 100 I 
64 fe) IOI I 
67 4 103 I 
fs 68 7 104 I 
9 3 = 
a u 5,578 5,578 
72 5 
CuHartT VIII 


TEAS 


CHANGE, IN WHOLESALE PRICES 
FROM ONE YEAR TO NEXT 


SS 


OO too ee Oo On Se 5 Bo 
PERCENTAGE OF CHANGE FROM AVERASE PRICE OF PRECEDING YEAR 


GRAPHIC METHODS 27 


if the curve as drawn lies above any point it should lie below 
some other, or, more accurately, the sum of the vertical distances 
which it lies above points in the actual distribution should 
equal the sum of the distances which it lies below other points. 
In drawing a free curve for incomes, Chart III, the preserva- 
tion of total area is a difficult thing to insure, but for maximum 
temperatures, Chart I, it can be accomplished with fair accuracy 
and little trouble. The personal element which enters into 
method (c) generally makes it inadvisable for published work; 
but for original, hasty and personal research it may well be 
the one most frequently used. 


Section 6. SMOOTHING DATA 
The smoothing of data preparatory to plotting (Method c) 
may be illustrated by the accompanying records of the U. S. 
Weather Bureau for New York City: 


TABLE XVI 
Mean Monthly Temperatures for 1917 


Jan. Feb. Mar. Apr. May June July Aug. Sept. Oct. Nov. Dec. 
B2Aeee -OMNSO. fa A120 O-2) 100-3) 74-7 4.0) 03-05 52-ONN AIL 225-0 


We have here a temporal series, and as is frequently the 
case, periodic fluctuations are shown. To obtain a general 
idea of variations within the year the curve at the end of 
December should join on to the curve at the beginning of 
January, as indicated below in Chart IX drawn by Method (c). 

It will be noticed that in the 1917 data there is a minor mode 
in January and a major mode in August. As such bi-modality 
is not typical we will smooth by means of the moving average 
method and plot the resulting series. The moving average 
method consists of replacing original items by averages of a 
certain number of class frequencies. In the present problem 
we will average the frequencies for two neighboring class 
intervals and assign the result to the point midway between 
the two frequencies. If we consider the averages for each 
month as belonging to the 15th day of the month, we can 
take the average of the temperatures for January and February 
and assign this average to the end of January or the first of 
February. Next the February and March temperatures are 
averaged and the result assigned to March 1. Continuing 


28 STATISTICAL METHOD 


throughout the series, finally averaging the temperatures for 
December and January, gives the data of Table XVI, indi- 
cated on Chart IX by the x’s. 


CHaArT IX 
0 Mean Monthly Tempena-tures NY.City Jan- Dec. 
Vv 
t 60 
RES [ele iela( lel stel = slehebelaral 

ees CCE 
oc | ceiersnctes CAA SE 
E co ---- Ave.- ATYTrs. BU 

55 
ee 0 a Pa 
ce si Sie ee 
t 4o | | 
co) 
> 35 

30 
pe 
>= 


TABLE XVI-a 


Mean Smoothed Temperatures for 1917 

[nie Pots ING ae Elec Ne rei ik Ys ce (Oka ING Oki 
28. 7m BOstn 333200 43.0 50.2 00:80) 71.2. 74.47 768.3) 57-5) 40-0 me oul 

The reason this process is called that of taking a ‘“‘moving 
average’? would be better exemplified if groups of three or 
more items were averaged, in which case each successive sum 
is obtained from the preceding one by dropping one item and 
adding a second. It will be noticed that this curve has but a 
single mode, is much more regular than the curve from the 
original data, and does not have as high a maximum or as low 
a minimum, which fact is a necessary consequence of the 
method of smoothing. Moreover, it represents the annual 
fluctuations better than the curve from the original data, as is 
shown by comparing it with the dotted line based upon the 
records for the 47 years from 1871-1917, given herewith: 


TABLE XVII 
Jan. Feb. Maz. Apr. May June July Aug. Sept. Oct. Nov. Dec. 
31.0 30.5 37.8 48.7 59.8 68.8 74.0 72.6 66.4 55:7 43.9 34.0 
Since the average of two unequal numbers is never as large 
as the larger or as small as the smaller of the two, the smoothing 


GRAPHIC METHODS 29 


process tends to flatten a curve out and lower modes. If the 
data are particularly irregular it is frequently desirable to do 
this to reveal a general trend, but it should be borne in mind 
that something of significance is always lost in the process of 
smoothing. Numerical calculations should never be made from 


TABLE XVIII 


Distribution of Marks given to Women in 8 Elective College Subjects. Below 
60 Failure; 60-74 Condition 


eee from Mary Theodora Whitley, A Statistical Study of College Marks 
— Master’s Dissertation, Columbia, 1906) 
i 
GRADE] (FRE- |Av. oF| Av. oF | Av. oF ||GRADE! (FRE- | Av. oF | Av. oF]! Av. OF 
QUENCY)|THREE| FIVE | FIFTEEN QUENCY)| THREE Five | FIFTEEN 
43 .07 75 27 mane Oak || Thiet 
44 -06 76 i 15.0 12.4 13.47 
45 | 07 77 II 11.3 15.4 | 14.33 
46 .06 78 16 14.3 21.0 18.00 
47 .07 79 16 29.0 20.0 20.00 
48 2 BL 80 55 24.3 20.4 | 22.80 
49 EG 2 all 81 2 223A el 2A Ome 20107, 
50 I 4 22 03 82 1 78, Zia E Ss 
51 3 Z -14 83 27 21.3 PAs) || BALOY 
52 2 alg} 84 14 39.7 32,0932200 
53 2 53 85 68 37-7 38.057 3am 
54 2 2 54 86 31 Nef —|| ANE) | eh7ecK0x0) 
55 i -4 2 53 87 43 41.3 43-0 | 37-07 
56 533 2 53 88 50 B3u7, 48.4 | 37.60 
57 2 54 89 23 56.0 43.4 | 39.00 
58 v2 93 90 95 41.3 45.2 38.87 
59 230 ee 93 gI 6 51.0 Adee ale sOus 
60 GS || ase tee, 1.17 92 52 36.0 44.0 | 35.87 
61 2.0 1.2 1.20 93 50 39.7 Bye Omlesuess 
62 12 127) 94 i07/ 43.3 41.0 | 29.26 
63 1.4 2.07 95 63 34.4 Pex) || FOL) 
64. DB 1.4 DG 96 23 32.3 Pesos || 220%) 
65 pM Bae 1.8 2.20 97 if 13.0 20,052.53) 
66 3.0 DP PG 98 5 5.7 8.0 | 15.20 
67 2 1.3 2.4 2.33 99 I 2.0 3.4 | 14.80 
68 2) 171 3.6 BaG 100 3 2) 11.33 
69 I 5.3 3.8 4.20 IOI i 8.00 
70 13 5.0 3.6 4.93 102 6.87 
71 5.0 3.4 6.00 || 103 2.67 
WZ I 1.0 3.4 FeO7 104 | TE 
AB I 1.0 G2 || WKoler7/ 105 -40 
74 I 9.7 7.4 | 10.40 106 Oy 
Hie || Wkex Tid Nid 


smoothed data, as a spurious consistency in the findings may be 
introduced and significance of the original data may be hidden. 


30 - STATISTICAL “METHOD 


The possibilities and limitations of smoothing will be better 
illustrated by application to the data of Table XVIII which 
are decidedly multi-modal. 

In the accompanying Chart X, the histogram represents 
the original data; the smoothed average-of-three curve is not 


CHART X 


Distribution of School Grades 


—— Graph from original data 
---— Graph from smoothed data. Avge of 5. 
oc o Graph from smoothed data. Avge of 15. 


Number Receiving Grade Indicated 


PISFSALEISS ISSR reek F533 8 88383388 


Grades’ Received 


92 
24 


shown; the ordinates of the smoothed average-of-five curve 
are represented by dots; and the ordinates of the smoothed 
average-of-fifteen curve are represented by o’s. 

The curve from the original data has fourteen modes, ten 
of them located at grades divisible by five and four located 
halfway between such grades. It seems that many teachers 
do not grade on a percentile scale in units smaller than five 
per cent, and that most of the remainder do not grade in units 
less than two and one half per cent. An examination of the 
frequencies in the average-of-three column shows that these 
minor modes, which occurred about every 24 units, have been 


GRAPHIC METHODS 31 


smoothed out by the process of averaging three neighboring 
measures, but that all the major modes persist though they 
occasionally are no longer exactly five units apart. It is 
found, by reference to the plotted distributions, that it requires 
the smoothed average-of-five curve (----) to smooth out 
the modes periodically occurring every five units. It is also 
apparent that the smoothed average-of-fifteen curve has 
flattened the mode at 90 and spread out the extreme measures 
altogether too much. It is therefore a desirable rule, when 
smoothing must be resorted to, to average such a number of 
neighboring groups as just cover the periodicity which it is 
desired to smooth out. If the data show great irregularity, 
rather than periodicity, it is better to average too small a num- 
ber of groups than too large a number. In the case in hand 
there is no doubt that the smoothing by averaging five class 
frequencies is the preferable method, but even so, something 
of significance, as is always the case, has been lost by the 
smoothing: To illustrate; the percentage of failures shown by 
the smoothed data, .57 per cent, is over twice as large as was 
in reality the case, — .26 per cent. 


Section 7. THe OcGtvE Curve 


When it is desired to determine the number of cases or per 
cent of the population lying below a certain record, it can be 
readily done if a curve is plotted showing sums of the fre- 
quencies of all measures below designated amounts of the 
trait. The method may be illustrated by the data of Table I. 
The first two columns below repeat that table; the third column 
is obtained by cumulating the frequencies in column two. 
The 1 in column three recorded opposite 65.5 means that one 
day (out of the 62) had a temperature less than 65.5. It will 
be noticed that two days had temperatures less than 66.5, 
or 67.5, or 68.5, or 69.5. Insuch a case it is sounder to assign 
the 2 to the point midway between the 65.5 and the 69.5 than 
to any other point in this stretch. Accordingly it is recorded 
in column three that 2 days had temperatures less than 68.0. 
Continuing there are 3 days with temperatures less than 70.5; 
4 with less than 72.5, etc. Finally it is to be noted that the 
last point is indeterminate, i... 62 days had temperatures 


ae STATISTICAL -METHOD 


less than 98.5, or 99.5, Or 100.5, etc. It is impossible to deter- 
mine from finite data what is the maximum temperature below 
which the temperatures for all days lie. It is of course also 
impossible to determine what is the minimum temperature 
above which the temperatures for all days lie. For this 


TABLE XIX 


Distribution of Daily Maximum Temperatures, July and August, 
New York City, 1917 


No. oF No. Cc - 
TEMPER- nate TIONS “TONS git Days Cross E HONS 
OF EXPRESSED EMPER- WITH oO XPRESSED 
ATURES GIVEN Nawon IN Per- || ATURES | GIVEN | No. or | IN PER- 
ee Days CENTAGES peal Byway ||P ASS 
65 ig 45 72.6 
65.5 I 1.6 84 2 
66 I 47 75.8 
66.5 85 4 
67 51 82.3 
68 2 3.2 86 3 
69 54 87.1 
70 I 87 I 
3 4.8 55 88.7 
71 I 88 2 
72 
4 6.4 89 57 91.9 
ao 
74 2 90 I 
6 9.7 gI 
US 3 92 
9 14.5 58 93.6 
76 I 93 
10 16.1 94 
ad I 95 I 
It 17.7 59 95.2 
78 3 96 I 
14 22.6 97 60 96.8 
79 I 98 2 
15 24.2 62? 
80 10 99 
25 40.3 62? 
8I 8 100 
a 53-2 62? 
82 5 : 101 
a 61.3 
83 7 


reason the zero and one hundred percentile points for this 
ogive curve are not plotted. This should be the case for all 
ogive curves — the common practice of plotting the lowest 


GRAPHIC METHODS 33 


and highest recorded data as the o and 100 percentiles being 
inaccurate and confusing. 

Column four gives the same data as column three, expressed 
in percentages of the total frequency. In the accompanying 
graph the ordinates are the cumulative frequencies in per- 
centages and the abscissas are the temperatures as shown: 


CHART XI 


Daily Maximum Temperatures 
July 6 August 1917T-New York City 


g 


© 
Oo 


Maximum Temperature 


20 50 GO TO 80 30 100 


ees Beene falling short of given temp. 


It is interesting to note that the relatively irregular data 
used has resulted in a fairly regular ogive curve, and that, 
without any smoothing. The ogive curve facilitates interpre- 
tation, e.g., it is immediately read from the curve that: 


5 per cent of the days do not attain a temperature of iam 


10 75° 
20 “c “ce “é “ce 4c ae “cc “ce “ce ac ae 8° 
cas “c “ce “c 4c 4c “ce “cc “ec “cb ac a, fo} 
5° 81 
90 4c “c ae “ic 4“ “le ““e “ic “ce “c ae 88° 
95 “ce ““ “c“ “c ““ oe 4c sé 4c “cc (Rs fo} 
FO oe have Maximum temperatures between) 70:5. 


Ames Aes \apetCs CLC. 
Or, interpolating the other way: 


A temperature of 95 or more is reached on 5 per cent of the days 


“ce sé 


“ac “cc sc é 


sé sc be - oc oc 46 oc “c S73 bc etc. 


The ogive curve may also be used to determine the mode, 
for if a smooth curve (not a polygon as here shown) is drawn 
through or near the points given and a ruler rotated so as to 
be tangent to the curve at successive points, that point at 


34 STATISTICAL *~“METHOD 


which the ruler ceases turning in one direction and starts to 
turn in the other (called the point of inflection) is the modal 
point, its value being read from the ordinate measures on the 
margin. Applying this method to these particular data the 
mode is found to be very close to 81°. The more important 
measures revealed by the curve are the median, or 50-percentile, 
the semi-interquartile range more briefly called the quartile 
deviation, or one half the distance between the upper and 
lower quartiles, the 10-percentile, the 9o-percentile and the 
1o—go-percentile range. For the data in hand these are re- 
spectively 81°, 79.5°, 84.5°, 2.5°, 75°, 88° and 13°. 


Section 8. THE GRrowTtTH CURVE 


The accompanying table gives smoothed scores in a reasoning 
test as given by Kelley (1917). Plotted they give a typical 
growth curve. 


TABLE XX 
ADULT 
ING GSO TRY CH TH Asie Te) GW TS A 
SCORE ON 
TRABUE 


CALE FO k.Len 2-2 003-7 Ost 0,507,217. 7G nS. San. OT 


CHART XII 


Growth Curve in Reasoning Test Abil iby 


Test Score 


GRAPHIC METHODS 35 


This particular curve is interesting in that it snows a flatten- 
ing at ages 13 and 14, which is not at all characteristic of growth 
curves of mental traits, but as the units of measurement, 
instead of intrinsic ability, could conceivably account for the 
phenomena the curve does not prove, but merely suggests, 
that there is a pubertal disturbance. For the purpose of the 
present statistical treatment no attention need be paid to the 
double inflection of the curve. 

Rotating the curve through 90° and looking at it in a mirror 
(as pictured in Chart XIII) shows its general resemblance to 
an ogive curve. It was possible in the case of daily tempera- 
tures to cumulate scores and obtain ogive curve data. By 
the reverse process it is possible from the ogive data to obtain 
the original distribution of temperatures. By parity of opera- 
tion it is possible to obtain measures of growth increments 
from an original growth curve. The growth curve may be 
plotted as herewith: 


CHart XIII 


Growth Curve-Reasoning Ability 
Score for Different Ages 


30 BO 6O 7O eo c=") 
Test Score 


Thinking of the abscissas as sums of increments of reasoning 
ability and recalling that the graph is for an average individual, 
whose maximum development or accumulation is to 94 of such 
increments (i.e., the total population of increments is 94) the 
graph may be read: At age 7 the individual possesses 11 
increments of reasoning ability; at age 10, 50 increments, 


36 STATISTICAL METHOD 


etc. This may be an awkward way of interpreting growth, 
but if it is desired to think of growth as a sum of increments it 
immediately suggests the determination of the increments 


added during each year of life as follows: 


TABLE XcXL 
vie econ a ese AGE 
o fe) 
¥ .5 (from 0-1) 
I fe) 
ee 1.5 (from 1-2) 
are 2.5 (from 2-3) 
3 I 
Zan 3-5 (from 3-4) 
2-+- 4.5 (from 4-5) 
5 5 
Sep 5-5 (from 5-6) 
or 6.5 (from 6-7) 
7 II 
s 7-5 (from 7-8) 
8 19 
12 8.5, etc. 
2) 31 
Me 9.5 
Io 50 
12 10.5 
II 62 
ie 1h 
12 67 5 
4 12.5 
13 71 
3 13.5 
a4 73 
3 14.5 
15 76 
6 15. 
16 82 5.5 
5 16. 
17 87 5 
3 17s 
18 90 7-5 
? (from 18-adulthood 
Adult 94 4 ( adulthood) 
94 


These growth increments plotted in the form of an ordinary 


frequency polygon give the following figure: 


GRAPHIC METHODS 37 
CHART XIV 


Distribution of Yearly Growth Increments in a Reasoning Vest 


Growth Increments 


OLIVA SAQNN 


}2@354 5678 9 0 Hf 213 AIS 617 6 9 £0 2 


Age 


The bi-modality of the growth increment curve is of course a 
consequence of the double inflection of the growth curve. Since 
the constants of this increment curve (mean, skewness, standard 
deviation, etc.) can be readily calculated, the curve has certain 
advantages over the growth curve. It should be a very con- 
venient form in which to present data for purposes of studying 
variability in rate of growth, variability in price changes, etc. 
In dealing with functions in which there is a loss in a given 
period, e.g., when an individual weighs less in one year than in 
the preceding, negative frequencies arise. These need cause 
no trouble if treated strictly algebraically and the negative sign 
preserved. 

Brown and Thomson (1921) have shown that the standard 
deviations of the class frequencies of such a curve are not given 
by the ordinary formula [Formula 25]. 


Section 9. THE GRAPHIC REPRESENTATION OF CATEGORICAL 
MEASURES 


The graphs thus far have pictured the frequencies or amounts 
of a quantitative or temporal variable, but if the frequencies of 
categorical measures are desired a different procedure is neces- 
sary. For example, if desired to represent the number of 


38 STATISTICAL METHOD 


days which lie in the following categories, (a) clear, (b) cloudy, 
(c) rainy, a large number of devices are possible. The signifi- 
cant feature to be portrayed in this as in all qualitative series 
is the magnitude of each category with reference to the others, 
or the proportions which each bears to the whole. This may 
be shown by appropriate lengths of lines constituting what is 
called a ‘“‘bar diagram,” by heights of shaded rectangles, by 
sectors of the required number of degrees, by appropriate 
number of discrete objects, men, bushels, ships, etc. The 
essence of an accurate portrayal lies in having the representa- 
tions of the two or more items alike in every respect except one 
and differing in that one by the required amounts. 

If the population of Texas is 5 million and that of Georgia 
3 million and if a man, representing Texas, is pictured beside 
a child, three fifths as tall, representing Georgia, the impression 
conveyed is entirely erroneous. The heights are in the ratio 
of 5:3, but the areas covered by the figures are approximately 
in the ratio of 25:9. However the situation is even worse than 
this for the weight of a man as pictured is to the weight of a 
child as pictured approximately as 125:27 and one is inclined, 
in so far as the pictures mean a man and a child, to make just 
such a comparison. 

If three dimensional objects are pictured upon a two dimen- 
sional surface to convey a one dimensional relation the objects 
should be identical in size and differ only in number. In the 
illustration mentioned, Texas could be represented by a row 
of five men and Georgia by a row of three. The use of men in 
picturing population, of sectors of a dollar in showing the items 
of a budget, of bales of cotton in picturing cotton production, 
etc., are conventional and expressive modes of presentation. 
Accuracy of presentation is favored by the use of rectangles of 
different lengths, but as independence of a heading may be 
accomplished by a proper choice of object for picturization, 
this method has certain indubitable advantages. However, 
if a two or three dimensional object is pictured either (a) all 
the dimensions except one should be kept constant and that 
one vary in the proportions desired, or (b) all dimensions should 
be the same and the number of objects vary. As an illustra- 
tion of (a) the amount of paving in two cities could be repre- 


GRAPHIC METHODS 30 


sented by the pictured lengths of two roads, the amount of 
coal produced by trains of gondola cars of different lengths, 
or the number of fish in the lakes at two resorts by angle worms 
of different lengths, etc. 

It is occasionally possible to represent not only the relative 
size of two categories but also their special temporal or spatial 
relation by graphic means. This is very prettily illustrated 
by the accompanying figure from Perry. (C. A. Perry, Educa- 
tional Extension. Quoted by Rugg, 1917.) 


CHART XV 


Environment of a Minor 


School Shop or Office ={Adult- 


hood 


Home : Street 


aid Amusement 
(9) 6 4 é/ 


Age, 


A cross section at any age reveals the proportions of time 
spent in the various ways, but it does more than this, as it re- 
veals the temporal relations of these proportions. 

If one considers how many pages of writing matter would be 
required to convey an idea of all the relationships shown in 
Chart XV he will appreciate the art involved in graphic 
presentation. If he will likewise consider that a written 
presentation would probably be obscure and dreary reading 
and that the joy of discovery belongs to one who studies an 
ingenious chart, he will appreciate that the graphic method 
at its best has far greater advantages than those of simply 
saving space and time. 

The last figure conveyed information as to three different 
items, (a) age, (b) time spent in different activities, and (c) the 
temporal disposition with reference to each other of different 
activities. It is thus a complex series, being quantitative, 
qualitative and temporal. 


40 STATISTICAL METHOD 


Accompanying is a block presentation of a complex series. 
It conveys information as to three different things, (a) date, 
(b) numbers of immigrants, and (c) country of birth. 


CHART XVI 


IMMIGRATION 
Ae neal . 
ear periods 
B91 Y to Pr9l0 
from US Alien 
Immigration Statistics 


This information is fully presented in the figure, but it very 
frequently is impossible clearly to present a three-dimensional 
situation by a picturization of a three-dimensional figure, for 
commonly a part of the figure would obscure other essential 
parts. The large immigration from Germany in 1891-95 almost 
hides the block showing the immigration from Germany in 
1896-1900, but as it does not completely hide it the relation- 
ships are readily apprehended. However, if the immigration 


GRAPHIC METHODS 41 


from Russia had also been larger in r8g1-95 than in 1896- 

tgo0, the block for the latter period would not have been 

visible and the method would have been unsatisfactory. 
Another device for presenting such data is given below: 


CHART XVII 


IMMIGRATION IN THOUSANDS BY FIVE YEAR PERIODS 
1891 TO 1910 


(to 4to5 Tto8 tOtoll 
Thousands Feces Thousands =| Thousands FRR Thousands 


ITALY 


AUSTRIA 
HUNGARY 


RUSSIA 
GERMANY 


189] - 1895 1896-1900 190! - 1905 1906-1910 


This is a more flexible method than the preceding, as there is 
no possibility of one block covering up another, but it requires 
a coarse grouping in the measure represented by the cross- 
hatchings, or shadings, and in general its features are not out- 
standing as are those of the preceding figure. 

In the block figure the last three countries are in the order 
demanded by geographical position of the countries. An addi- 
tional fact, such as the numbers of literate and illiterate immi- 
grants, could be represented by shadings of appropriate areas 
upon the tops of the blocks. Still another, such as age, or sex, 
or vocation, could be shown by the color of the ink used in the 
cross-hatching. Even this does not exhaust the possibilities 
of graphic presentation upon a single two-dimensional surface. 


42 STATISTICAL METHOD 


It is difficult to give a summary of the principles underlying 
graphic portrayal as they differ with the number of dimensions 
presented and with the continuous or discrete nature of the 
data, but the recommendations contained in the preliminary 
report of the joint committee on standards of graphic presenta- 
tion are of broad applicability. This committee represented a 
wide field of statistica’ workers and was formed upon the invita- 
tion of the American Society of Mechanical Engineers. Its 
recommendations as given by Haskell (1919) are: 

1. The general arrangement of a diagram should proceed 
from left to right. 

2. Where possible represent quantities by linear magnitude, 
as areas or volumes are more likely to be misinterpreted. 

3. For a curve the vertical scale, whenever practicable, should 
be so selected that the zero line will appear in the diagram. 

4. If the zero line of the vertical scale will not normally 
appear in the curve diagram, the zero line should be shown by 
the use of a horizontal break in the diagram. 

5. The zero lines of the scales for a curve should be sharply 
distinguished from the other codrdinate lines. 

6. For curves having a scale representing percentages, it 
is usually desirable to emphasize in some distinctive way the 
100% line or other line used as a basis of comparison. 

7. When the scale of the diagram refers to dates, and the 
period represented is not a complete unit, it is better not to 
emphasize the first and last ordinates, since such a diagram 
does not represent the beginning and end of time. 

8. When curves are drawn on logarithmic codrdinates, the 
limiting lines of the diagram should each be at some power of 
10 on the logarithmic scale. 

g. It is advisable not to show any more codrdinate lines than 
necessary to guide the eye in reading the diagram. 

to. The curve lines of a diagram should be sharply dis- 
tinguished from the ruling. 

tr. In curves representing a series of observations, it is 
advisable, whenever possible, to indicate clearly on the diagram 
all points representing the separate observations. 

12. The horizontal scale for curves should usually read from 
left to right and the vertical scale from bottom to top. 


GRAPHIC METHODS 43 


13. Figures for the scale of a diagram should be placed at 
the left and at the bottom or along the respective axes. 

14. It is often desirable to include in the diagram the numeri- 
cal data or formule represented. 

15. If numerical data are not included in the diagram it is 
desirable to give the data in tabular form accompanying the 
diagram. 

16. All lettering and all figures in a diagram should ‘be placed 
so as to be easily read from the base as the bottom, or from the 
right-hand edge of the diagram as the bottom. 

17. The title of a diagram should be made as clear and com- 
plete as possible. Sub-titles or descriptions should be added 
if necessary to insure clearness. 


PROBLEMS 


I. Smooth the temperature data by means of a moving average of three 
class frequencies and plot. What is the modal value? 

2. Express the populations of California, Oregon and Washington as 
indexes with 1900 as base. Which state showed the greatest relative 
growth in the decade 1900-1910? 

3. Chart VI shows that relative to 1907 retail prices of steak in Chicago 
did not advance as fast as wholesale prices. Choosing each year in turn 
as base, determine the relative increase in the wholesale prices and the 
retail prices of steak for the succeeding year, and answer the question, 
“Tn how many years did retail price advances fail to keep pace with 
wholesale price advances?”’ Using data in the last column of Table XII a 
answer the same question with reference to Wholesale prices and Retail 
prices of 22 common articles. 

4. (a) Plot an Ogive curve for the raw data of Table XVIII and on the 
same paper (b) an Ogive curve for the same data as smoothed by a moving 
average of fifteen class frequencies. 

5. Plot hypothetical data giving incomes in Great Britain in the form of 
an Ogive curve. What is the mode? Fill out the following table: 


Incomes Received by Successive Percentiles 


PERCENTILES I 5 I0 20 25 30 40 50 60 75 80 90 95 99 
INCOMES 


6. Save work for future reference. 


CHAPTER III 
THE MEASUREMENT OF CENTRAL TENDENCIES 


Section 10. AVERAGES 

A tabulation of the data pertaining to a distribution pre- 
sents all the facts, and a histogram or frequency polygon makes 
possible the visualization of this detail. Ordinarily, however, 
the detail is so great that it cannot be interpreted. In this 
case certain measures of the total distribution are serviceable 
in summarizing the data. The most important of these are 
averages, or measures of central tendency. The most signifi- 
cant averages are (a) the mean [more accurately the arithmetic 
mean], (b) the median, (c) the mode, (d) the geometric mean, 
and (e) the harmonic mean. Note that these are all averages. 
The word ‘‘average”’ is frequently used synonymously with 
mean (arithmetic mean). It will occasionally be used in this 
text in such expressions as ‘“‘the average of the means,” in 
order to avoid the more accurate but awkward expression 
“the mean of the means.” Ordinarily ‘““mean’”’ will be used 
consistently to designate the arithmetic mean, and “‘average’”’ 
as synonymous with ‘‘measure of central tendency,” thus 
meaning any one of the five measures listed above. 

The most important single item of information to be known 
about a distribution is what it is a distribution of. 

The second in importance is the number of cases in the distri- 
bution, or, as it is usually expressed, the population. 

The third, is to know some measure of central tendency, 
some average. 

The fourth, to know some measure of the degree to which 
the measures scatter, or lie above and below the average, i.e. 
to know a measure of dispersion or deviation from the average. 

The fifth, to know if the measures are symmetrically dis- 
tributed with reference to the average, or if there is a bunching 
of measures on one side of the average and a long tailing out 
of measures on the other side; i.e., to know a measure of 

skewness. 
44 


MEASUREMENT OF CENTRAL TENDENCIES 45 


The sixth, to know if the measures are exceptionally densely 
grouped at the average, giving a high peak to the frequency 
polygon (leptokurtic, ie., 8, of section 36 is greater than 3.0) 
or if the distribution is rather flat in the middle and contracted 
at the ends, thus tending toward a rectangular shape; (pla- 
tykurtic, i.e., 82 < 3.0) or if they show a mean between those 
two conditions as does a normal distribution (mesokurtic, 
B2 = 3.0); inshort, to know a measure of kurtosis. 

These are all of the essential measures in the case of a uni- 
modal distribution; the next important item would be a 
measure of the tendency to have more than a single mode, or 
place of dense frequency. 

No treatment will be given in succeeding chapters of bi-modal 
curves, but if it is noted that uni-modal curves include anti- 
modal or U-shaped curves,— those having large frequencies 
at the extremes and small frequencies in the middle, as well 
as L-shaped curves, rectangular distributions, and all forms 
of positive uni-modal curves, it will be seen that the great 
majority of distributions found in biology, economics, and 
psychology belong to the uni-modal type and that a knowledge 
of the six items mentioned above is adequate for all but a small 
number of distributions. 

Measures of skewness and kurtosis are essential in mathe- 
matically fitting curves to observations and are treated of in 
Chapter VIII on Curve Fitting. The calculation of averages 
is dealt with in this chapter and the relative excellence of the 
different averages will be considered in connection with their 
probable errors in the next chapter. 


Section 11. Tue ARITHMETIC MEAN 


The mean may be defined as the sum of the separate measures 
divided by the number of them. This definition immediately 
suggests the method of calculation: add the measures and 
divide by the population. If an adding machine is available 
and other measures of the distribution are not desired, this 
method is the most expeditious one to follow. Generally, 
however, it is more economical of time first to group the 
measures and arrange them according to magnitude, as was 
done with the Temperature data, Table VIII. Repeating 


46 


STATISTICAL METHOD 


these data we have the first two columns of the accompanying 
Table XXII. 


The third column illustrates one method of 


TABLE XXII 
Calculation of the Mean 
DEvIA- DEVIA- 
TIONS GROUPED TIONS 
TEMPER- FRE- : FROM FROM 
PRODUCTS PrRopuctTs RE- 
ARBI- i 
ATURES [QUENCIES Lagiel QUENCIES piegiee? 
ORIGIN ORIGIN 
x i. fX <a jz E be FE 
65 I 65 —I5 — 15 
66 I 66 =i == 1 2 = — LO 
69 I —4 ai 
70 I 70 — 10 — 10 
71 I 71 sme, Se 
72 I Baie =] 
74 2 148 — 6 — 12 
75 3 225 SAS =e 6 =e —12 
76 I 76 — 4 ee 
al I 77 ae ag) auto 
78 ee) 234 ae = AS) 5 ee eo 
79 I 7) aoe! =e = 
80 10 800 oO i 
81 8 648 I 8 23 oO 
82 5 410 2 10 
83 7 581 3 21 
84 2 168 4 8 13 I 13 
85 4 340 5 20 
86 3 258 6 18 
87 I 87 7 7 6 2 12 
88 2 176 8 16 
go I go 10 10 I 3 ¢ 
95 I 95 15 15 
96 I 96 16 16 2 5 10 
98 2 196 18 36 
99 2 6 12 
62 5,056 185 | 62 0 
M=81.548 — 89 a 
96 16 
62 62 
Correction = 1.548 Correction 
Arbitrary = 3 X .258 
Origin = 8o. Correction = .774 
M = 81.548 Arbitrary 
Origin = 81.00 
M = 81.774 


* Greek alphabet given in appendix. 


MEASUREMENT OF CENTRAL TENDENCIES 47 


calculating the mean; the fourth and fifth columns a briefer 
method, in that it involves handling smaller numerical magni- 
tudes; and the last three columns another method which is 
still shorter in case the number of class intervals is large. For 
a method of calculating the mean, standard deviation, and 
higher moments by means of continued summations see Brown 
and Thomson (1921) and Elderton (1905). 

The headings of these columns are typical and will be 
repeatedly used in subsequent examples: 

X (or Y) will be used regularly, as here, to designate gross 
scores. 

f (F) designates class frequencies. 

&* (¢) designates deviations of scores from an arbitrary 
origin, or starting point. In column four, £ represents devia- 
tions of the gross scores from the arbitrary origin 80, while in 
column seven ¢ represents deviations of class intervals each 
of which is three times as large as the class interval obtaining 
in the gross scores, e.g., from o—-1 in column seven is one £ unit 
but it is three X units. 

x (or y) has not been used in any of the above columns since 
it is reserved for a very definite purpose. It will consistently 
mean a deviation from the true mean. In the case in hand, 
if deviations from 81.548 had been recorded they would have 
been designated as x measures. Throughout the rest of this 
text x (or y) will mean a deviation from the mean or from an 
origin so near to the mean that no attention need be paid to the 
fact that it differs slightly from the true mean. 

N. One further symbol is universally employed —N (n) 
stands for the population. In the present example N = 62. 
[n occasionally has other meanings, particularly when it ap- 
pears as a subscript or a superscript.] 

M is used to designate the mean. 

>. The symbol 2 indicates not a measure but an operation. 
When placed before a symbol standing for a measure it indi- 
cates that the sum of all such measures is to be obtained, 
e.g., Df means the sum of the frequencies — in the illustration 
=f = 62. 

With these definitions in mind it will be seen that the mean 

* Greek alphabet given in appendix. 


48 STATISTICAL METHOD 


may be calculated according to any one of the following 
formulas: > 

M = a (The mean).... .[1] 
This formula is used in case measures are not grouped or 
arranged according to magnitude. 

M = =e (The mean)....[1 a] 
This is the method used in columns two and three. 2fX = 5056 
and =f = 62. These two formulas are really identical, for 
=fX simply means that each X is taken as many times as it 
occurs. There is no mathematical operation in use in which 
the sum of the measures is taken irrespective of the frequencies 
in the various classes, so that in subsequent examples ZX will 
mean identically the same thing as 2fX and will frequently 
be written for the latter as it is more concise. For similar 
reasons Yé will be written for Zf&; Ux for Zfx; Za* for Zfx*; etc. 


M = Arbit. Orig. + ane (The mean).... .[1 }] 


This is the method illustrated in columns four and five. It is 
called the method of moments, i.e., of tendencies to produce 
rotation about a point. Moments may be taken about any 
origin and if the positive exceed the negative it means that 
the origin chosen is too small. Similarly if the negative exceed 
the position moments the guessed mean, or arbitrary origin, 
is too large and a negative correction is necessary. If the 
guessed mean is 80 and calculation shows that there are 96 
excess positive moments then, since there are 62 cases in all, 
the moment corresponding to each measure should be 
96/62 = 1.548 greater than it is in order to make the positive 
exactly equal the negative moments. This point where the 
moments exactly balance is the mean. Obviously if the guessed 
origin is moved by 1.548 units, i.e., if 1.548 be added to 80, a 
value will be determined such that if moments about it are 
taken the negative and positive moments will exactly balance. 


M = Arbit. Orig. ay (Class interval) > E 


(The mean) [1 c] 


This method is illustrated in the last three columns. It is a 
moment method applied to data which have been grouped. 
The guessed origin is here 81, the class interval 3, i.e., 3 of the 


MEASUREMENT OF CENTRAL TENDENCIES 49 


gross measure units, and 2¢ = 16. Solving M = 81.774. The 
discrepancy between this value and that obtained before is 
due to the grouping, — the true value being 81.548 and not 
81.774. Such error may be either positive or negative, and, 
unless very great precision is demanded, may be disregarded 
when the data show no pronounced periodic disturbances and 
when the number of class intervals is 12, or greater. (For 
considerations leading to the number 12 see section 46.) 

It will be noted that there are rr class intervals in column ¢. 
In the case of distributions which show peculiar local groupings 
great care should be exercised in combining class frequencies. 
In the case of the College Marks given in Table XVIII a 
combining of measures into groups as follows: 50.0 —54.0, 
55-0 —59.9, 60.0 —64.9, etc., and a designating of the middle 
points of the groups as lying at 52.5, 57.5, 62.5, etc., would 
lead to substantial error in calculating the mean, since the 
measures in the groups are not all evenly distributed. To 
illustrate: if the 12 measures in the interval. 65.0 — 69.9 are 
grouped and assigned the value 67.5 an error of 1.33 has been 
introduced, for calculation shows that the true mean of these 
12 measures is 66.17. An error of 1.33 in a single group would 
not be serious, but for the College Marks data the error is 
typical of each group, so that a calculation of the mean from 
data so grouped would lead to systematic raising of the mean 
by an amount between 1 and 2 units. Whenever systematic 
local tendencies are apparent in data and grouping is resorted 
to, it should be endeavored to so group that the middle of each 
group interval corresponds to a local mode; e.g., with the 
College Marks the class intervals of the groups should be as 
follows: 47.5-52-5, 52-5-57-5, etc., since the mid-points of these 
intervals, 50, 55, etc., correspond to local modes and also approxi- 
mately to the means of the measures in the group intervals. 

The data in Table XXIII reported by the New York State 
Industrial Commission and taken from the New York World 
of Jan. 27, 1919, are so grouped as to make it impossible 
accurately to determine any sort of an average wage. These 
data show that 6 per cent of women factory workers receive 
from $6-$7.99 a week, but certainly the mean wage of this 
group is not $7.00, for in all probability a large number re- 


50 STATISTICAL METHOD 


ceive exactly $6.00, another large group exactly $7.00. while 
lesser groups receive wages of $6.50 and $7.50, and but very 
occasionally would there be a wage such as $6.49 or $7.99. 
Since one end of the interval, $6.00, has a large frequency that 
is not balanced by the other end $7.99, the mean of the group 
may be expected to lie below $7.00, possibly considerably 
below. Similarly the 14 per cent receiving wages from $8.00 
to $9.99 presumably receive a mean wage much below $9.00. 
It is difficult to group data of this kind without introducing 
large error, but if the intervals had run, $6.25-$6.75, $6.76— 
$7.24, $7.25-$7.75, etc., probably the mid-point of each group 
would be close to the mean of the group. An attempt to deter- 
mine an average wage from the data as given might easily be 
nearly 50 cents in error. The unequal distances covered by 
successive intervals in the grouping proposed is a disadvantage 
which is more than compensated by having the mid-points and 
the means of the groups approximately coincide. 


TABLE XXIII 
Full-time Earnings of 20,597 Women in Factories and 23,203 in Mercantile 
Establishments 


FACTORIES STORES 

PER CENT PER CENT 
WSS date) 5 eae Go ta Bee I I 
Wess tian ae Ss eens Sere ween peunre Nk. ol. ae i) 7 
Less thaneelO i Sue. os) em eee 21 23 
I SSSanGhal GH) Ge Oe A Oe Ub 6 42 44 
essithan gel 4 mesures ere nes Pied os 59 64 
STA OL OVC a oe (oes Sn ee 4I 36 
FOOL OVE wae ee es eee II 9 


In any research the question usually arises whether to group 
at all, and, if so, what groupings to make. It has already been 
suggested that groupings should not be made which result in 
less than twelve classes. This is a lower limit. If the distribu- 
tion is pronouncedly asymmetrical, as for example is that 
showing incomes in Great Britain, twelve is far too small a 
number of classes. The lower end of that curve could not be 
at all satisfactorily represented if the income range covered 
by each interval should be as large as £100, nor with such 
grouping could the arithmetic mean be accurately determined. 


MEASUREMENT OF CENTRAL TENDENCIES 51 


A range of £40 for the lower intervals will answer, though a 
range of £10 or £20 would be much better. Since incomes 
range from about £0 to £200,000 there would be no less than 
5000 classes needed to represent the distribution if the class 
interval is £40. 

The distribution of Wholesale Price Indexes is not as 
markedly asymmetrical, but it has such a phenomenal peak 
at “no change” that a coarse grouping cannot be used, or this 
characteristic is hidden. The plotted distribution has 41 
classes and 38 of them have frequencies other than zero. As 
plotted, the peak at “‘no change” is less pronounced than it is 
in reality and if the grouping were coarser it would be still less 
apparent. A slightly coarser grouping would not have very 
great effect upon the mean, but it would have decided effect 
upon other constants, particularly those measuring kurtosis. 
Forty classes is close to the minimum which would be satis- 
factory for either graphic or numerical work with wholesale 
price index measures. 

For graphic presentation of College Marks a grouping into 
classes of five units each, with interval limits chosen as already 
indicated, would result in a graph nearly as satisfactory as 
that based upon the moving average involving five neighboring 
classes. Such a grouping leads to but 11 classes, which is too 
small for very reliable results. However, groupings into units 
of 4, 3, or 2 are not satisfactory, as they do not conform to the 
local periodicity, which is five units. A grouping into units 
of 24 would be excellent from the standpoint of statistical 
accuracy, but as it would involve splitting the frequencies in 
the gross score classes it would be uneconomical of time. All 
things considered it would seem advisable to use the gross 
score intervals, or, for rough work, a grouping of five gross 
score intervals. 

The situations presented by Incomes, Price Indexes, and 
College Marks are not typical, but illustrative of the more 
difficult grouping problems encountered. 

Consider the Temperature data, Table XXII, and note that 
if two gross score intervals had been grouped the frequencies 
in Column F would have been for intervals whose mid-points 
would be 65.5, 67.5, 69.5, etc., that when three gross score 


52 STATISTICAL. METHOD 


intervals are combined the mid-points are, as shown, 66, 60, 
72, etc.; and that in general if an even number of gross score 
intervals are combined the mid-points of the resulting intervals 
do not coincide with the mid-points of any of the original 
intervals but lie halfway between original measures. Ac- 
cordingly if an even number of gross score intervals are com- 
bined an entirely new table has to be made out. As this 
involves work and an additional chance for error it is undesirable 
if a grouping of an odd number of intervals will suffice. 

As a general rule, applying to distributions not especially 
asymmetrical (skew) nor peaked (leptokurtric), (1) an odd 
number of gross score classes should be grouped, (2) the number 
of classes resulting from grouping should not be less than 12, 
and (3) the number of gross score intervals in a group should 
equal the number involved in local periods, or divide into such 
number without remainder, or be an integral multiple of such 
number. Finally in case the distribution is markedly skew or 
leptokurtic, conditions (1) and (3) remain the same but (2) 
the number of classes should be greater than 12 and great 
enough that significant portions of the distribution are revealed 
in such detail as is commensurate with their importance. 

In determining the number of gross score intervals to be 
grouped in ordinary data a serviceable rule to follow is to 
subtract the smallest from the largest measure and divide by 
twelve. The nearest odd integer below the resulting quotient 
is the proper number of gross score intervals to combine. E.g., 
in the case of maximum temperatures (98 — 65)/12 = 2.75. 
The nearest odd integer below 2.75 is 1. Accordingly the 
data are not grouped at all and the gross score intervals of 1° 
kept as the proper steps. No material inaccuracy would have 
been introduced by combining two of the gross score intervals, 
but it would have been of questionable economy to do so. 

Applying the rule to the College Marks data we have, 
(99 — 50)/12 = 4.1. The nearest odd integer below is 3. It 
would therefore be appropriate to group three intervals were it 
not for the fact that there is a local periodicity extending over 
5 gross score intervals. Applying to wholesale price indexes 
[103 — ( — 55)]/12 = 13.2. Since the original scores were 
recorded in 2 per cent steps the interval of 13.2 per cent is 


MEASUREMENT OF CENTRAL TENDENCIES 53 


equivalent to 6.6 of the gross score intervals. The nearest 
odd integer below 6.6 is 5, which would be the proper number 
of gross score intervals to combine, were it not for the fact 
that the data are very exceptional, having a phenomenal mode. 

The proper labeling of class intervals is important in con- 
nection with grouping. Class intervals of either grouped or 
ungrouped scores should be labeled by recording the lower and 
upper limits of the interval, e.g., 75.50-76.50, or by labeling 
the mid-point of the interval, e.g., 76.0. If the successive class 
intervals are the same the labeling of the mid-point is both 
clear and concise. A great deal of needless confusion is caused 
by improper labeling of intervals. The writer has found this 
especially true with reference to age data, such as the following: 


SCORE IN 
AGE HEIGHT oR AGAIN, AGE ARITHMETIC 
IN CM. TEST 
12 140 12 18.324 
13 150 13 20.002 
14 155 14 20.980 
15 160 15 23.545 


With data such as these it is a matter of sheer guess whether 
the scores correspond to mean ages of 12.0, 13.0, etc., or of 12.5, 
13.5, etc. Ifa single score is recorded for a class interval it 
should universally be that of the mid-point of the interval, 
and in order to make it unambiguous the labeling figure should 
be carried one decimal further than the unit representing the 
class interval, e.g., if the above tables had read: 


SCORE IN 
AGE HEIGHT AGE ARITHMETIC 
IN CM. TEST 
12.0 140 12.5 18.324 
13.0 150 13.5 20.002 
14.0 155 14.5 20.980 
15.0 160 15.5 23-545 


140 would have been taken as the mean height of individuals 
exactly twelve years old, etc., and no uncertainty would arise. 


54 STATISTICAL. METHOD 


Section 12. THe MEDIAN 


The median of a series is the value of the mid-most measure, 
hence half the measures composing the series lie above it and 
half below. 

We will proceed to calculate the median of the daily maximum 
temperatures in New York City for July and August, 1917. 
The raw data are given in Table VIII. A hasty inspection 
shows that the lowest daily maximum temperature is 65° and 
the highest 98° and, a priori, knowing of no reason to expect 
that the distribution is skew it is assumed that the median 
lies about halfway between these two extremes. We will, 
therefore, make out a table of frequencies, as shown below: 


i 


NUMBER OF Days HAvING TEMPERATURES NOTED 


Temperature below 80 -|-|-|-[- -|-[-|-|[- -|-|-|-|- = "Sh 

fa of 80 -|-[-[|-|- -|-I-I-[- = 10/7 
“ 6c 81 =|+5|-|- | | ee 5 
“cc a“ 82 sec | ad (Pass Wes, = 5 
“c “cr 83 ofeains belo: | = 7 
“ a“ 84 = 2 29 
# above 84 <I-LH -HEEE -HHEE = 35 

62 


Adding up measures from both ends, it is found that the median 
measure lies in the group with temperature 81°; or, since there 
are 62 measures, it lies halfway between the values of the 
31st and 32d measures. As all measures from the 26th to the 
33d inclusive are recorded as 81°, the 31st and 32d are so 
recorded and 81° may be taken as a rough approximation to 
the median. However, it is not to be presumed that the 
maximum temperatures on all of the eight days for which 
the temperature of 81° has been recorded were exactly 81.0°. 
It is more reasonable to consider that the average of these 
8 temperatures was 81 and that they ranged all the way from 
80.5 to 81.5. Furthermore, since this interval is small with 
reference to the entire range of temperatures, 34°, we may 
with satisfactory warrant consider that these 8 measures are 
evenly distributed over the interval 80.5-81.5, as shown in 
the diagram on page 55. 


MEASUREMENT OF CENTRAL TENDENCIES 55 


26th | 27th | 28th | 29th | 50th | Sist | 32nd | 53rd 
MEASURE | MEASURE |MEASURE |MEASURE |MEASURE |MEASURE |MEASURE | MEASURE 


8 8 e : 2 8 8 5 
@ 3 = 3 a a 5 ~ : 
TEMPERATURES 


It is immediately seen that the temperature midway between 
the 31st and 32d measures is 81.25°. This is therefore the 
median sought. 

This method is not the best possible, but gives a good 
determination for all practical purposes. For other methods 
see Bowley (1907). The best possible median is determined 
by mathematically fitting a curve to the observations and then 
integrating (or summing areas) from one end of the curve up 
to the point giving one half the total area. As thus determined 
the median is a function not only of position above or below 
a certain class value, but also of the distances of the measures 
above and below this median class, because the magnitude of 
each of the measures from the lowest to highest enters into the 
determination of the equation which fits the distribution. 

Following in principle this integrating method, a median 
may be determined mechanically from a carefully plotted 
frequency polygon by the use of a planimeter. A guess is 
made as to the median and a perpendicular erected. The 
planimeter is run around the boundary of the area thus cut 
off and the result noted. If the area recorded by the instru- 
ment is not exactly one half the total area an adjusted guess 
as to the median is made and the process repeated. This 
may be continued until the desired degree of accuracy is ob- 
tained. Continuing the preceding illustration: If 63 days 
had been considered, and if the temperature of the added day 
had been greater than 81° there would have been one measure, 
the 32d, which would have had just as many measures below 
it as above, and the temperature corresponding to the middle 
of this mid measure, 81.3125°, would be the median. The 
median, or mid measure, may therefore be defined as the value 
of the (N + 1)/2 measure, but as the value of a measure is 
the value of its mid-point, this is equivalent to saying that the 
median is the limit of the range covered by N/2 measures 


56 STATISTICAL- METHOD 


counted either down from the top or up from the bottom. The 
method pursued in the calculation of a median may be sum- 
marized and expressed in a formula as follows: 

1. Arrange the measures in order of magnitude and list the 
frequencies for each class interval, grouping such intervals as 
are well below, or well above the median interval. 

2. Let N = the total number of cases, i.e., the sum of the 
frequencies of all the classes. 

3. Determine the class in which the (N + 1)/2 measure 
lies. If it lies between two classes, as sometimes happens when 
N is even, the common boundary of these two classes is the 
median and no further calculation is necessary. (The infre- 
quent case when these two classes do not have a common 
boundary is treated in the next paragraph.) 

4. Let f = the frequency of this class. 

5. Let 2 = the class interval, or range covered by the median 
class. 

6. Let F = the sum of the frequencies of all the classes below 
this class. 

Let F’ = the sum of the frequencies of all the classes above 
this class. 

7. Let v = the value of the lower boundary of this class. 

v’ = the value of the upper boundary of this class. 

8. Let Mdn = the median value. Then 


7 (Median calculated 
f from below up).... .[2] 


Mdn = 0! — 2 i (Median calculated 
i from above down) [2 a] 


These two values of the median will be identical. 

Using the first of these formulas to calculate the median of 
the maximum temperatures we have the following: 
62 
8 
I (ep) 
25 (frequencies below the median class) 
80.5 (lower boundary) 

31 


Mdn = 80.5 + —— I = 81.25 


I 


at a, = 
towed 


MEASUREMENT OF CENTRAL TENDENCIES 57 


Or again, using the second of these formulas and calculating 
from above down: 
N, f, and 7 as above 
F’ = 29 (frequencies above the median class 
v’ = 81.5 (upper boundary) 


Mdn = 81.5 — a I = 81.25 


All cases have been covered by steps 1 to 8 except when the 
median lies between two classes which do not have a common 
boundary, as in the accompany- 
ing illustration: Here the 


(N . awe Sor me Botan’ teh FREQUENCIES| SCORES CLASSES 
classes ¢ and e, but the upper limit I 9 g 
of class c, 5.5, is not at the same 3 8 f 
time the lower limit of class e, a y q 
6.5. The median value might be 2 5 c 

: : 2 4 b 
considered to lie anywhere be- ; 3 x 
tween 5.5 and 6.5, but the most a 
reasonable procedure is to call it = 


the average of these two values. 

The median is therefore (5.5 + 6.5)/2 = 6.0. With this under- 
standing every distribution yields a single value for the median. 
If this value has been calculated from the bottom up it is well 
to check by calculation from the top down. 


Section 13. PERCENTILES 


The median is the value below which 50 per cent of the 
measures lie. It is, therefore, the 50-percentile. Similarly 
the 1o-percentile is the value below which ro per cent of the 
measures lie, etc. The derivation which gave the formula 
for the calculation of the median may readily be generalized 
so as to provide a formula for the calculation of any percentile. 

Let N = the total population. 

Let P, = the percentile, the value of which is to be calcu- 
lated. 

Let p = the proportion of cases having values smaller than 
P,. Thus P, is the 100 p-percentile. For example, if the 
15-percentile is being considered, p = .15, and P.15 is the 
symbol standing for the value of the 15-percentile. 


58 STATISTICAL “METHOD 


Determine the class in which the 100 p-percentile, or the 
(pN + 4) measure, lies. 

Let f, = the frequency in this class. 

Let 7, = the interval or range covered by this class. 

Let F,= the sum of the frequencies in all the classes below 
this class. 

Let v, = the value of the lower boundary of this class 
interval. 


news 
Pepe PN — F, i (Value of a percentile — calculater 
ad fe e from: below tip) seen aeeeree S| 

This is the formula for the calculation of any percentile 
proceeding from small values of the variable to large values. 
If the calculation is from the other end of the distribution the 
formula is: 

eee (I -—-p)N- Fo ; (Value of a percentile — calcu- 
is # lated from above down).. .[3 a] 
in which, 

v', = the value of the upper boundary of this class interval 

F’,= the sum of the frequencies in all the classes above this 

class, 
To insure accuracy it is well to calculate from below up and also 
from above down. 

The same procedure as in the case of the calculation of the 
median is to be followed in the case of a percentile lying some- 
where in a group with zero frequency. 

For sake of illustration this formula will be used to calculate 
(a) the s50-percentile (the median), (b) the 25-percentile (the 
lower quartile), and (c) the 75-percentile (the upper quartile), 
for the temperature data. 

(a) The Median (Mdn) 

N = 62 
p= .50 
(.50) 62 + } = 313 
The 313 measure lies in the 81° class. 


foo = 8 

1.50 = iI 

F 50 = 25 

V.io = 80.5 

P59 = 80.5 + (.50) 62 — 25 I. = 81.25 


8 


MEASUREMENT OF CENTRAL TENDENCIES 59 


Note that in calculating from the top down F’5= 29, and 
Y 50 = 81.5. 
(b) The lower quartile (L.Q.). 
N = 62 
p= .25 
(.25) 62 +4 = 16. 
The 16th measure lies in the 80° class. 


f.2 = 10 

4.9 = I 

F.25 = 15 

v.25 = 79.5 

Px = 79.5 + 752 — 15; = 7055 


Note that in calculating from the other end F’.; = 37, and 
V' 25 = 80.5. 
(c) The upper quartile (U.Q.). 
N = 62 
P= ".75 (75) 62 + 3 = 47. 


The 47th measure lies in 84° class. 


fiw = 2 
4.75 = I 
fe = 45 
v.75 = 83.5 


.75) 62 — 
*. Pits = 83.5 pees I = 84.25 


In calculating from above down F’.;; = 15, and v’.75 = 84.5. 

The difference between the two quartiles is the interquartile 
range and of necessity 50 per cent of the cases lie in this range. 
In the problem in hand the interquartile range is 4.7° and indi- 
cates that one half of the days studied had maximum tempera- 
tures within 4.7° of each other. 

The consideration of percentiles has been a diversion from 
the main purpose of this chapter, the study of averages, oc- 
casioned by their intimate connection with one of these aver- 
ages, but we will here take up the main problem again in the 
study of the mode. 


60 OTATISTICAL METHOD 


Section 14. Ture Mopr 


The mode is the value in a series at which the greatest 
frequency lies, or it is the place of densest frequency. In the 
case of Price Indexes, Table XV, this greatest frequency lay 
at ‘‘no change’’ in price, which is accordingly the mode. 

In the case of College Marks, Chart X, a pronounced mode 
at go is shown by the raw data. However, such data have 
several modes and it is correct to speak of the distribution as 
multi-modal. If, from a priori consideration, it is thought 
that the minor modes are due to causes either chance or irrele- 
vant with reference to the main trend, it is desirable to smooth 
them out and determine the one mode. In the case of College 
Marks the minor modes at 85, 80, 75, etc., are not due to chance 
but to psychological causes lying in the minds of instructors 
when called upon to grade individuals upon a finer scale than 
parallels their competency to make judgments. These modes 
at 85, 80, etc., would not be expected to vanish if the popula- 
tion were increased many fold, but the minor modes in the 
temperature data, Chart II at 83°, 85°, 75°, etc., are probably 
due to chance and would disappear if records for a number 
of years were taken, but the mode at 80° would probably 
remain, though it might shift slightly one way or the other. 
If one is studying temperatures this latter mode only is signifi- 
cant. If one is studying the distribution of talent of pupils, 
the major mode only of the College Marks distribution is 
wanted, while if one is studying the psychology of pedagogues 
the minor modes are very significant. 

Assuming that the major mode only is sought we will consider 
its calculation. It is obvious that if the mode shown by the 
raw data is taken it will be very unreliable, for usually a change 
of but a measure or two will shift the mode, e.g., a shift of but 
a single measure in the temperature data from 80° to 8r° 
would make it indeterminate whether the mode was 80° or 81° 
while a shift of two measures from 80° to 83° would shift the 
mode 3°. For this reason the mode is always determined from 
smoothed data if the raw data show irregularities in the vicinity 
of the mode. 

The College Marks data have been smoothed by the moving 


MEASUREMENT OF CENTRAL TENDENCIES 61 


average method. (Sec. 6.) A perusal of Table XVIII shows 
that an unquestioned mode is not established by the class 
frequencies given by a moving average involving three classes. 
In that case modes exist at 86, 89, 91 and 94 — the largest of 
these being that at 89. When five class frequencies are aver- 
aged, modes appear at 88, 90 and 91 — the largest being at 88, 
so that the mode is still undetermined. When fifteen frequen- 
cies are averaged a single mode appears at 89, but the fre- 
quency of the 89 class is only .13 larger than that of the 9o 
class, out of a population of 773, so that the reliability of the 
determination is obviously not very great. 

The distribution of frequencies given by averaging ten classes 
does establish the mode at 89.5 (the proof of this is left as an 
exercise) and accordingly 89.5 is the correct value to adopt as 
the mode. 

The moving average method of determining the mode may 
be summarized as follows: Calculate smoothed class frequen- 
cies in the neighborhood of the mode, by means of a moving 
average involving a small number of intervals. Repeat the 
process, averaging greater and greater numbers of intervals, 
until a major mode with no minor modes in close proximity 
appears. The smallest grouping by which this major mode 
is obtained, gives the best result. 

Another method for determining the mode follows from the 
relationship between the mean, median, and mode. Pearson 
has shown (1895) that in the case of his Type III curves the 
following relation holds: 

Let Mo = mode, Mdn = median, M = mean, and o = the 
standard deviation of the distribution (o defined in the next 
chapter). Then 

M — Mdn 


Mo = M — a aa (The mode)... [4] 


in which c is a magnitude differing slightly for different distri- 
butions and closely given by the equation 
.0846 (M — Mdn)? 
b= 13300 a SAE ee oe [4 a] 
Therefore, knowing the mean, median and standard devia- 
tion, the mode may be calculated. Pearson’s Type HI curve 
is a skew curve limited at one end and unlimited at the other. 


62 STATISTICAL METHOD 


It is a very flexible curve and excellently represents a large 
number of skew distributions. If by inspection, a curve seems 
to approach a finite limit at one end, to be unlimited at the 
other, and if its kurtosis (see Sections ro and 36) is not extreme, 
no serious error is likely to be introduced by assuming it to be 
a Type III curve. 

Since the mean and median can be very reliably determined, 
the mode derived from them is a very much more stable measure 
than that as determined in the last section. 

In case the distribution has a pronounced mode near the end 
at which it terminates, and a long and very thin tail at the 
other end, e.g., of the type of the distribution of incomes, it 
is well to use Formula [4 a], but for the great majority of 
skew distributions it is quite accurate enough to use c = .33. 
The mode is then given by equation: 


Mo = M — 3.03 (M — Mdn) (The mode) [4 8]. 


Applying this method to the College Marks data for which 
89.5 has already been found to be the mode, as calculated by 
means of a moving average, we have, 


M = 86.495 Calculated by formula [1] 
87.690 rt i. Sen 12 
M — Mdn = 1.195 

Mo = 86.495 — 3.03 (— 1.195) = 90.12 


< 
5 
i 


Of the two values obtained the greater credence should be 
given to 90.12. Using, instead of .33, the value of c as given 
by the full formula [4 a], leads to 90.13 as the mode; hence it is 
evident that the short formula is satisfactory for such a distri- 
bution as that of College Marks. 

In handling distributions so decidedly skew that the skewness 
approaches 1.0, in which case ¢ = 3(M — Mdn), neither of 
the two formulas for calculating the mode from the mean 
and median can be used. 

The three methods given, (a) graphic method of Section 7, 
(b) by smoothing the data, and (c) by derivation from the 
mean and median, are merely make-shifts if the student is 
able to avail himself of the precise determination resulting 
from mathematically fitting a curve to the data. 


MEASUREMENT OF CENTRAL TENDENCIES 63 


Section 15. Tur Harmonic MEAN 


Dunn’s Wholesale Price Index is the cost of a year’s supplies 
of a certain type. If the mean of the twelve of these indexes 
for a given year is calculated, it gives the mean cost of that 
year’s supplies. But suppose instead of keeping the amount 
of goods constant and noting variability in price, the total cost 
had been kept constant and the variability in the amount of 
goods purchasable had been noted; how would one then pro- 
ceed to obtain the mean cost of a given amount of goods? 
The following table, adapted from data given in Bradstreet’s 
Journal, will serve to illustrate the problem: 


Ruling Wholesale Prices, November 1 


1913 I914 1915 1916 1917 1918 
POUNDS SUGAR 
BouGHT FoR $I 23.0 18.5 19.4 13.33 11.9 11.11 (Designated as 
X measures) 


Let it be desired to determine the mean price of a pound of 
sugar for the six years. We will first build up a table giving 
the cost per pound at the successive dates, by taking the 
reciprocals of the X measures as follows: 


Ruling Wholesale Prices, November 1 


LOLA LOLA TOLS eLOLOn eIOL7MeELOnS 
CosT OF SUGAR : 
IN DOLLARS .0435 .0540 .0515 .0750 .0840 .o900 (Designated as 


L 
=> measures 
Xx ) 


The mean of these measures is .06633 which accordingly is 
the mean cost of a pound of sugar for the six years. It is to be 
noted that if the mean of the X measures is found, 16.266, and 
the reciprocal taken, .06148, the same value is not obtained. 
The magnitude .06148 is not the mean price per pound — it is 
the reciprocal of the arithmetic mean number of pounds bought 
for $1, and a difficult measure to interpret, though not meaning- 
less. The information of moment is the mean price per pound, 
or the reciprocal of this, the number of pounds which could be 
bought when paying the mean price per pound. This latter 
is the harmonic mean. In the case in hand it is the reciprocal 


64 STATISTICAL METHOD 


of .06633, or 15.08. Designating the harmonic mean by H.M. 
and employing the usual notation it is defined by the equation: 
H, M, = ——= 
Lie 2 
Nx. 

In words: The harmonic mean is equal to the reciprocal of 
the mean of the reciprocals of the measures. 

In deciding whether to use the arithmetic or the harmonic 
mean one should first decide which is properly the magnitude 
to remain constant (in the illustration, [a] the amount of sugar 
bought, or [b] the amount of money spent). There is seldom 
a doubt as to which should be the constant. If the data are 
recorded in such a manner that this appropriate item is constant, 
then the arithmetic mean isto be used. Ifthe data, as recorded, 
make this item the variable, then the harmonic mean should 
be employed. 

One further illustration may make this clearer. The fol- 
lowing scores were made in a three-minute test in addition: 


(Harmonic mean)... [5] 


X: NUMBERS OF 
PROBLEMS 
COMPLETED MO=SL ecm 4 a> Oe 7a See Om TOmrr T 


f: NUMBERS OF 
Pupits MAK- 
ING SCORES 
DESIGNATEDEIO MON ls On 4 7a 0.00 6S 2 oe Onl otala—ae7 


The question should now be asked, Is the significant measure 
(a) the rate at which a pupil works a problem, or (6) the number 
of problems that he can work in a given time? The writer 
would judge that the rate at which the pupil works, or the 
number of minutes required to work one problem, is the more 
straightforward, readily comprehended and generally mean- 
ingful measure. Accepting this and noting that the data as 
recorded make the time element constant and not the number 
of problems worked, the harmonic mean is seen to be the proper 
mean to use. 

If in this problem the arithmetic mean is calculated, there 
is a certain significance in it, but the reciprocal of this mean 
should not be compared with rate measures in which the number 
of problems is constant and the time allowed varies. 

For discussion of the properties of an index number based 
upon the harmonic mean, see Fisher (1921). 


MEASUREMENT OF CENTRAL TENDENCIES 6s 


Section 16. GEomMETRIC MEAN 


If the items in a series are so related (usually a temporal 
relationship) that the expression of each one in terms of the 
preceding one, 1.e., relative to the preceding one, is the informa- 
tion required, then the averages thus far treated do not serve 
the purpose. These measures are, of course, ratios and the 
geometric mean is the significant average. 

In Table XII, column two, are given the costs, on January 1 
of successive years, of a year’s supplies of certain common 
products. If the cost for each year is expressed in terms of 
the cost the preceding year, we have the following Table: 


TABLE XXIV 


Dunn's Wholesale Price Index for each Year Expressed as a Relative to the 
Preceding Year 


1908 oa a SR Fon sae, AMO ToE 
1909 a Gt Gg Son me ta AOI 
1910 ee ents ee al TO26 
IQII ay ues + ie e325 
19I2 ee ee ee ae O74: 
1913 See ee en ede bee 9 7,00) 
1914 eee eee es OZ OO 
1915 ee ee le BAR Ole yA 
1916 Bo ee ey Pe Ce ee URANO} oN9/ 

9)9.2681 

1.02979 


If the mean advance per year is desired and the arithmetical 
mean, 1.02979, taken as the measure of it, serious error would 
be involved. The ratio of the basal year, 1907, with reference 
to itself is of course 1.00000, so that the mean advance as 
given by the arithmetic mean is .02979 and nine times this 
gives .2681, a measure for the advance over the entire period 
of nine years. That this is an incorrect measure is shown by 
the fact that the ratio of the prices in the last year to the 
basal year (137.666 + 107.264) is 1.28343, showing that the 
actual advance is .28343. The reason for this discrepancy is 
that each advance is figured upon the preceding year as a base 
and not as a proportion of the price in the basal year. Strictly 
speaking 1907 is basal for 1908 only; 1908 being basal for 1909, 
etc. Accordingly 1.0561 X $107.264 gives the price for 1908. 
The price for 1908 times .9882 gives the price for 1909, or 


66 STATISTICAL METHOD 


9882 X 1.0561 X $107.264; etc. Finally the products of all 
the nine ratios, 1.28343 times $107.264, gives $137.666, the 
price for 1916. In place of these nine different ratios whose 
product gives the ratio of the last year to the basal year, may 
be submitted a single mean ratio which, when multiplied by 
itself nine times, gives the same product. This is the geometric 
mean and, designating it by G.M. and the ratios for the separate 
years by p1, p2, p3, .-- Pn, it is defined by the equation: 


G. M. = V/p1 X p2 X ps X *** X pn (Geometric mean) . .[6] 


It may be readily calculated by means of a Log Log slide rule 
or by means of logarithms as follows: 


log G. M. _ log pi + log pp» + log p3 + swe + log pn 


n 


(Geometric mean) [6 a] 


Using a slide rule the G. M. for the preceding data is found to 
be 1.0281. Using six place logarithms it is found to be 1.0282. 

A check on these values is possible by taking the 9th root of 
the ratio of the 1916 price to the 1907 price. By logarithms 
this is found to be 1.02811. This figure means that on the 
average, wholesale prices increased 2.81 per cent each year, 
from 1907 to 1916. 


The Index of Means, or of Sums 


Another problem arises in connection with indexes which 
may be illustrated by the wage data in the last three columns 
of Table XII. The essential portsons are copied below: 


Chicago 
UNION WAGE PER Hour 

Painters Linotype ic 

E Operators arpenters 
ooh ey MG egies Ge oe 50¢ 50¢ 56.3¢ 
1916 tS ne A ee re he ar 70 50 70 

Same data expressed as ratios — 1907 as base 

1907 oe aE ee CRAG! oe 100 100 100 


1916 gt es See ee 140 | 100 124.334 


MEASUREMENT OF CENTRAL TENDENCIES 67 


Let us suppose that there are the same numbers employed 

from each of the unions, and let us designate this number by 

N. The question that concerns us is how to determine the 
140 -F 100 = 124.334 


average increase in wages. Does = 1.2144, 
3 


indicating an increase of 21.44 per cent, give it? 
Bearing this in mind let us approach it by another method. 
(e 50+N 50+ WN 56.3\ _ 
3N 7 


52.10 cents, and in 1916 it is Nita ~ * ape iB) = 63.33 


The mean hourly wage in 1907 is 


cents. Dividing 63.33 by 52.10 the ratio of the mean wage in 
1916 to that in 1907 is found to be 1.2156, giving an increase of 
21.56 per cent. The two values found are not identical and 
it can be easily proven that in general they will not be, for, 
letting P, L and C equal the initial wages in the three unions 
respectively, and p, /, and ¢ the ratios of the final wages to the 
initial wages in the three cases; then p P,1L, and ¢cC equal 
the final wages respectively; and, ESE aH a the 


mean initial wage; also, SOLE oe a seat = the mean final 


wage; and the ratio of these two wages is DE ieee This 


Poa Lao 
(Doe sate 
3 


is identical with only in case P = L = C, which 


in general is not the case. The fact that the initial wages 
were so nearly equal in the illustration accounts for the small 
difference in the two results. 

We may therefore conclude that it is inaccurate to take the 
mean of ratios as equivalent to the ratio of the means (or sums) 
of final and initial scores. 


Section 17. WEIGHTING 


If the numbers of workers in the three trades had been the 
same throughout and if because of considerations other than 
population the trades possessed importances W, w, w, then it 
would have been proper to multiply the wages by amounts 
equal or proportionate to W, w, w. This is “weighting.” 


68 STATISTICAL METHOD 


The multiplying of a score by the number of cases having it 
has at times been called weighting, but in this text the term 
will be used to mean the multiplying of scores by amounts 
determined not at all, or not solely, by the population, but 
from other evidences of importance. (See Section gr.) 

It is generally a difficult problem to determine just what con- 
stitutes proper weighting. When one is confronted with the 
problem of weighting measures which are to be combined and 
feels incompetent to accurately judge of their relative impor- 
tances he is inclined to *‘solve”’ the problem by “‘not weighting at 
all.” But the failure to assign weights is actually a very definite 
weighting — that of calling the units involved in the various 
measures of equal importance. This is not the same as saying 
that the failure to assign weights results in giving equal impor- 
tance to the different items. This latter is not the case if the 
dispersions of the scores for the various items differ. This 
point, together with others involved in weighting, is treated at 
length in connection with partial correlation. It may cer- 
tainly be said that, judging by the ordinary run of studies in 
economics and psychology, much more error has been com- 
mitted by ‘‘not weighting at all” than by improper weighting. 


PROBLEMS 


1. Calculate the mode for the maximum temperature data of Table 
VIII. Is the short formula, in which ¢ = .33, appropriate to use in this 
case? 


2. Calculate the L. Q., Mdn. and U. Q. for the hypothetical distribution 
of incomes, comparing with graphic determinations (Problem 5, Chapter 
TD) 


Calculate the mean. Assume that the mean income for the highest 
income group is £21,000. Since these data have very irregular class in- 
tervals, in calculating the mean, great care must be taken in assigning 
€ values to the different classes, no matter where the arbitrary origin is 
chosen. For this reason it will be more accurate and almost as short if 
the method given by Formula [1] is followed. The student may well 
make the calculation both ways to become familiar with the handling of 
irregularly grouped data. 


Calculate the mode: (a) by finding the point of inflection in a smoothed 
ogive curve, (b) by deriving from the values of the mean and median, 
using ¢ = .33 and (c) the same, using the full formula for c. In doing this 
take o = { the interquartile range. 


MEASUREMENT OF CENTRAL TENDENCIES 69 
3. The following three series are scores of individ ials in three tests. 
They may be used as practice series for the calculation of M, Mdn., L. OF 


and of constants treated of in subsequent chapters. 


Practice Series 


Ivoryabeaten, | rote ces tee | © Scoaps Or Siiee | eevee oF cs 
Series I Series 2 Series 3 
A I5I 132 148 
B 147 132 143 
Cc 145 130 153 
D 138 128 148 
E 134 121 135 
F 124 103 134 
G 120 105 138 
H 118 122 138 
I 116 99 128 
J 114 124 129 
K 113 109 131 
L 107 99 136 
M 106 103 124 
N 105 98 126 
O 104 108 133 
iP IOI 104 122 
Q 100 115 137 
R 99 II! 119 
Ss 98 107 121 
ae 96 92 124 
U 89 96 118 
V 87 94 126 


4. Calculate the 5th, 1oth, 15th, etc., percentiles for the scores in hand- 
writing upon the Ayres and Thorndike scales, given in Table XXX, 
Section 34, and check answers against columns 1 and 2, Table XXXII, 
Section 35. 

Group the Ayres data in 3’s and the Thorndike data in 5’s, calculate the 
same percentiles and check against answers in columns 3 and 4 of Table 
XXXII. 


CHAPTER IV 
MEASURES OF DISPERSION 


Section 18. THe Megan DEVIATION 


Distributions having the same average may differ markedly 
in the spread of the measures composing them. The following 
two series of measures have the same mean, median and mode, 
but the scatter of the measures is very different: 

Geuis te dve con) One O19 g es) eer Oe eae 

Uy Ge By) oe Beh dy shih nice Ad 
The range in the first series is three, while in the second it is 
fifteen. If deviations from the mean, 8, are calculated, they 
run: 

=— Tpeoaly — I; 0, Ch eR ee to be ae i Site & 

a7 hee Tine 75), — 44 — Sy. 10)" Se ATs Zen 10 de eo 
The means of these two series of deviations are of course zero if 
taken algebraically, but if taken absolutely, i.e., irrespective of 
sign, they are .545 and 6.0 respectively. These are the mean 
deviations. 

The mean deviation may be defined as the sum of the abso- 
lute values of the deviations of the separate measures from 
the mean, divided by the population. 

It can be calculated by the method of moments. Referring 
to Table XXII, columns four and five: If the deviations had 
been from the mean, 81.548 (in which case they would have 
been designated by x instead of by £) instead of from 80, a 
mere guess, the products f - x, would have been slightly different 
from those recorded in column f - , and their sum, irrespective 
of sign, divided by their number, 62 would have been the 
mean deviation. Since, however, the calculation of deviations 
from the mean, 81.548, involves fractional or decimal magni- 
tudes it is in practice inconvenient to determine the mean 

70 


MEASURES OF DISPERSION 71 


deviation in this manner. Deviations from 81.548 run as 
given herewith in line (x): 


(x): —16.548 —15.548 —14.548...—2.548 —1.548 —.548 .452 1.452...16.452 
(€): —15 —I4 —I3. ... I o Lae) ae Seer S 

For purposes of comparison, the corresponding deviations 
from the arbitrary origin, 80, are given in line (£). It is seen 
that algebraically each £ measure is 1.548 larger than the 
corresponding x measure. In absolute value all the & devia- 
tions up to and including those for class 80°, 25 in number, 
are 1.548 too small; those in class 81°, 8 in number, are .452 
too large; and those in classes 82° and on, 29 in number, are 
1.548 too large. Tabulated, the data show: 


25 measures 1.548 too small 
29 measures 1.548 too large 


Excess of 4 measures 1.548 too large = excess positive moment of 
4 X 1.548 = 6.192 
Excess of 8 measures .452 too large = excess positive moment of 
8 X .452 = 3.616 
Total excess positive moment = 9.808 


The sum of the moments as calculated from 80° is 89 + 185 = 
274, but this is too large by 9.808. Accordingly the sum of 
the deviations from the mean is 264.192 which, divided by 62, 
gives 4.26, the mean deviation sought. 

The calculation, as shown, is cumbersome. A simple 
formula for the calculation of the mean deviation from the 
first moment about zero as an arbitrary origin is herewith 
derived. 

Given the series 11, 12, 13, 13, 16. Mean = 13.0. The 
deviations of the successive measures from the mean are, 
— 2, —1, 0, 0, 3 respectively, giving a mean deviation of 1.2. 
These deviations are (11-13), (12-13), (13-13), (13-13), (16-13), 
but since all are to be taken positively they must be written, 
(13-11), (13-12), (13-13), (13-13), (16-13). Using the usual 
notation we have: 

(M — X,) + (M — X2) + (Xs — M) + (Xs — M) + (Xs — M) 
A.D. = N 


Kee Xe Xe oa — Ae Mat MMH MH MM 
ss N 


72 STATISTICAL .METHOD 


If F = the number of measures lying below the mean (here 2), 
then it is seen that M enters in positively F times and nega- 
tively (N-F) times and that the X’s which are smaller than the 
mean enter in negatively (the sum of these may be represented 


bg . . 
by =X) and that those greater than the mean enter in posi- 
by 


N 
tively (this sum may be represented by 2X). Accordingly we 
F+1 
have: 
N F 
Ye SX PM SN = Pe 


WAPI) em Coe Z mi [7] 


Since, however, 


N F N F F N io wis 
ZTX—-TX= > XATX —2EX HTX —2TX 
F+1 I F+1 I I I I 
and since, 
N 
=X = NM 
I 


the formula becomes 
2 F at 
A. Ds= N (FM — =X) (Average deviation from the mean) [7 a] 
I 


This is a very simple formula to use in connection with an 
adding machine. If the entries are not arranged according to 
magnitude add them on the machine and determine the mean, 
at the same time determining the population, N. Then add 
all the measures which are smaller than M, thus obtaining 


F 
x X, at the same time determining the number of such measures, 


I 
F, Thus two listings on an adding machine will yield the 
three important constants N, M and A.D. 

If the measures are arranged according to magnitude a 
single listing will suffice, it only being necessary to take sub- 
totals for each of the group frequencies in the neighborhood 
of the mean. For example the adding machine listing for the 
preceding series would be as shown herewith: 


* This formula, with empirical proof, was independently discovered by two of the 
writer’s students, Miss Elva Wald and Mr. John P. Herring. 


MEASURES OF DISPERSION 73 


If 
12 


2 2205) 
13 
13 

4 49 s 
16 

5 65 ¢ 


One would guess that the mean lay somewhere between 12 

and 14 and would therefore take sub-totals after listing 12 

and again after listing the 13’s. Having N= 5 and the sum = 

65, division gives the mean, 13.0. The listing shows that 

there are two measures below the mean and that their sum is 
F 

23,1e.,F = 2and 2X = 23. Thusimmediately 


I 

ENS DS Cl SX seo) 29) 
The peculiar expedition of this formula should make it service- 
able in large studies where time of computation is an important 
factor. It will shortly be shown that the probable error of 
the average deviation is but slightly greater than that of the 
standard deviation, so that unless the greatest accuracy is 
demanded, and unless the standard deviation is needed for 
such further purposes as use in correlation formulas, the aver- 
age deviation will be found advantageous. 

Returning to the Wald-Herring formula [7] it may be noted 
that if deviations around some point, P, other than the mean, 
be taken, and if F = the number of measures lying below this 
point, the formula becomes: 


A. D. around pt. P = [2,8 -2X+@F- N) P} 


I 
N 
(Average deviation around any point P) [8] 


ee av then P is the median and the formula becomes: 
2 


N 
I 2 (Average deviation from 
= es —-=rxX 
A. D. around Mdn = Nn [ 2 the amedian)) ie eene eee [9] 


2 


Note that if N is odd, af and ( 2 + r) are fractional. In this 


74 STATISTICAL METHOD 


case it is necessary to add one half of the median measure in 
each summation. For the series 11, 12, 13, 13, 16; 


A. D. around Mdn = } [(6.5 +13 +16) — (11 + 12 + 6.5)] = 1.20 


This is the same as the average deviation from the mean for. 
in this particular problem, if measures are taken at their face 
value, the median and the mean coincide. Such measures as 
usually occur may, with insignificant error, regularly be taken 
at their face value in calculating the average deviation from 
the median, but they should not be so taken in calculating the 
median itself. The method already given in Section 12, based 
upon the assumption that the measures spread themselves 
evenly over the interval, is to be followed in calculating the 
median. 

The mean deviation, unless stipulated to the contrary, is 
always calculated from the mean. It is at times desirable to 
calculate it from the median, in which case it should be defi- 
nitely labeled ‘‘mean deviation from the median.”’ A real 
reason for calculating it from the median exists in the fact 
that when so calculated it is smaller than when calculated 
from any other point, as can readily be shown: 

Let ¢ = a deviation from the median. Then the 


M. dev. from the Mdn = zie) 


Let & = a deviation from a point P which is A distance 
from the median; A < one class interval. Then § = ¢+ A. 


e]_=l¢el+Fa-(n— FA 


M. dev. from P = Z| 
n n 


Suppose A is positive, then P lies above the median and F > 

>> 
Hur, 
n 

positive magnitude. If A is negative, P lies below the median 


(n—F) so that the above right hand member = 


and F < (n — F), so that the right hand member still = aes - 


a positive magnitude. Therefore, whether point P lies above 
or below the median the mean deviation from it is greater than 
zl 


ae the mean deviation from the median. The proving of 


MEASURES OF DISPERSION 75 


this same relation when A > one class interval can be readily 
accomplished and is left as an exercise. Accordingly the mean 
deviation is a minimum when taken from the median. 


Section 19. THE QUARTILE DEVIATION 


A measure of dispersion may be obtained by taking the 
difference between any two percentiles. One such measure, 
the difference between the upper and lower quartiles, or the 
interquartile range, has already been mentioned. The most 
customary measure, however, is one half this measure, the 
semi-interquartile range, which for convenience and brevity 
is called the quartile deviation, and is designated by ‘‘Q.” 
Using the usual notation for the upper and lower quartiles, 
we have: 

BAU ,O.— 14,0. 


2 


Q 


(Quartile Deviation)...... [10] 


It is to be noted that the quartile deviation is not a deviation 
from any of the averages thus far considered. It is simply a 
measure indicative of dispersion. If thought of as a deviation 
at all it should be as one from a point midway between the 
upper and lower quartiles. A rather better way to interpret 
it is as one half the interquartile range, a range within which 
lie 50 per cent of the measures. 


Section 20. THE 10-90 PERCENTILE RANGE 


A range somewhat larger than the interquartile range has 
advantages over it and the quartile measure derived from it, 
as a measure of variability. I have shown (Kelley 1921 new) 
that for a normal distribution the interpercentile range having 
the minimal error is that between the 6.917 and the 93.083 
percentiles. A range but slightly different from this and 
having nearly as great reliability is that between the roth and 
goth percentiles. This distance is called D and is given as 
the most serviceable measure of dispersion based upon per- 
centiles. 

D = P.9 — P.10 (10-90 percentile range)..... {r1] 
Its calculation and interpretation are very simple, and as over 
72 per cent more cases are required to secure as great reliability 


76 STATISTICAL METHOD 


in the quartile deviation, this measure of dispersion is recom- 
mended wherever percentiles are used. Its relationship, in 
case of a normal distribution, with other measures of dispersion 
is given in Section 31. For proof of the next ten formulas the 
reader is referred to the reference cited. 

The standard error of D is given by formula [16] which in 
turn depends upon formulas [40], [43] and the following: 


pee NE Y tn which p< p! (The correlation between any two 


qb’ percentiles Pp and Pp’)...... {12] 
» ‘ ee oe Np'y _ 2 Npq’ (Thestandard error of an inter- 
Py—Py (y)? (y’)? yy’ percentile range).......... [13] 


in which p < p’ and y is the ordinate of the curve at the per- 
centile P,, and similarly for y’ and Py. 
Assuming normality, formula [13] becomes 


Tis Eos Ne o big 2 pq (Standard error of an interper- 
Da ae, VN N(z)? * (2’)? ae! centile range in a normal 
distribution) seseee ee eee | 
in which z and 2’ are ordinates as given in Table K—W for 
arguments of g and q’. If, further, percentiles equally distant 
from the ends of the distribution are calculated, p = 1 — p’ 
and formula [14] becomes 


(Standard error of a symmetrical 
interpercentile range in a 
normal distribution) ........[15] 


We now obtain for the standard error of the 10-90 interper- 
centile range 


o 
ie Oe eh indestaek mete eee 
Akers: 79224 [16] 


Entering Table K-W with q = .1 we find that x = 1.281552. 
Thus D = 2.563104 o which gives 


* 
Die teenetna 
vV 


(Probable error of D).[16 a] 


This is a very convenient formula, as, for ordinary purposes, we 
may take 


P.E.p = rape a sis niaswre Wie stele nel one me entree LOGE 


*On p. 744 of the reference cited (Kelley 1921 new) this value is incorrectly given as 
-6001 


MEASURES OF DISPERSION 77 


Two other constants which are of value in determining the 
type of a curve are Sk and Ku defined by the following equa- 
tions: 

Sk = Po — 4 (P.0 + Pw) (A measure of skewness 


k based on percentiles) .[17] 
The standard error of Sk is 


= D_ (The standard error of the per- 
os, = “0914s = == 


VN centile measure of skewness) . .[18] 
ree Q (A measure of kurtosis based on 
u=<= ; 
D (DSL) so sho0cuonocaadne [19] 
The standard error is 
— -27779 (The standard error cr the per- 
Ku V/N- centile measure of kurtosis) .. .[20] 


For a symmetrical distribution Sk = o and for a mesokurtic 
distribution Ku = .26315. If a given distribution has a 
Ku > .26315 it is platykurtic and if < .26215 it is leptokurtic. 

We thus see that the percentiles of a distribution may be 
used to answer some of the important questions of curve type. 
If populations are large, so that standard errors are small, 
resort to the longer though generally more accurate (not always, 
as it is dependent on curve type) methods of Chapter VII 
may frequently be avoided. 


Section 21. THe STANDARD DEVIATION 


The standard deviation is far more universally significant 
than are any of the preceding. It is based upon the squares 
of the deviations from the mean, instead of upon the first 
powers as is the mean deviation. The exceptional advantages 
of this measure of dispersion will appear in connection with 
subsequent work. The standard deviation is defined as the 
square root of the mean of the squares of the deviations and is 
regularly designated by “‘c.’’ Unless otherwise stipulated 
deviations are always from the mean. Using the usual nota- 
tion: rae 

22) ae (The standard deviation 
ite Ae of a distribution)... .[21] 
This is a fundamental formula and should be recognized 
whether written as 


78 STATISTICAL METHOD 


or as, 
The calculation of the standard deviation for the temperature 
data of Table VIII is as follows: 


TABLE XXV 
Calculation of o 


Dev. 


CG FRE F SECOND 
See QUEN- Ane. M oe a oe SECOND MOMENTS FROM MEAN 
Xx f E hate ite pes A B (: 
65 1 |— re — "15 225 | 1(—15—6)?= 1 (15 ?—26[—15]+6?) 
66 I — 14 196 | 1 (—14—6)?= 1 (14 2?—26 [—14]+6?) 
69 
70 I |—I10 |—10 100 | I (—10—6)?= 1 (10?—26[—10]+6?) 
71 OA We ec Rn toed ae Sr] i(= 9-8)? 1 0%—-26 (= ola 
We 
74 2 |-— 6 |—12 72 | 2(— 6—6)?= 2( 6?—26[— 6]+6?) 
75 a Wigs. |awika 75 | 3(— 5-0)= 3 S*—26(— sin 
76 feta ah eed 16 (9K 4 O)tee Tat 25 fa 4}438) 
77 tact ieee 9 [Eh 9-8)= tk Biel i 
78 3 |- 2 |— 6 I2| 3(— 2—6)?= 3( 22?—26[— 2]+6?) 
79 I jJ—1/|—1 : I} 1(— 1—6)?= 1( 12—26[— 1]+6?) 
— og 
80 10 fo) o | 10( o—6)? =10( o—26[ o]+6?) 
81 8 I 8 8| 8( 1-6)? = 8( 12—26[ 1]+6?) 
82 5 2 10 20 | 5( 2-6)? = 5( 22—26[ 2]+6?) 
83 7 3 21 63 | 7( 3-6)? = 7( 3?—28)[ 3]+6?) 
84 2 4 8 32 | 2( 4-8)? = 2( 4?—28{ 4]+8) 
85 4 5 | 20 100 | 4( 5-4)? = 4( 52-24 5]+8%) 
86 @ 6 18 108 | 3( 6—5)? = 3( 62—26! 6]+6?) 
87 I 7 7 49| 1( 7-8)? = 1( 72—268[ 7]+6?) 
88 2 8 | 16 128 | 2( 8-8)? = 2( 82—28[ 8]+62) 
90 I 10 10 100 | 1 (10—6)? = 1 (10?—26 [10]+6?) 
95 I 15 15 oP: 1(15—6)? = 1 (152—2 6 [15]+62 
96 I 16 16 256 | 1(16—6)2 = 1 baaiens renin 
2 2 18 36 648 | 2 (18-65)? = 2 (182—2 6 [18]+62) 
62 185 
96 2524 
6=1.548 |40.710 Zz & —26z2£é 56? 


o= V 40.710 — (1.548)? = 6.190 


MEASURES OF DISPERSION 79 


If the arbitrary origin, 80, had been the mean, the standard 


deviation would be given by V2524/62, but as the arbitrary 
origin is an amount 6, (= LE/N = 96/62 = 1.548), below the 
mean, each £ deviation is algebraically too large by the amount 
5. Accordingly, if, in place of Dé we calculate Z(— — 6)? it 
will lead to the appropriate sum from which to calculate o. 
Magnitudes (é — 6)? are expanded and tabulated in the last 
three columns of Table XXV. It is immediately seen from 
the table, and is of course also apparent by squaring the bi- 
nomial, that 24? = 2(é — 6)? = Sf —D26é+ 28. Since 
6 is a constant and does not vary from class to class = 2 6 = 
2 6zé and similarly 2s? = N& (here = 62 X 1.5487). The 
summation Zé has already been obtained in summing the first 
moments and, from the definition of 6, 2 = N6. Accordingly 
Dx? = D2 — 2 Ne? + N62, and 


DIR 52 (The standard deviation of a distribution 


Som N calculated from an arbitrary origin) . .[22] 


The symbol 6, usually standing for a small magnitude, should 
not be so interpreted here, for the formula is rigorously exact 
whether the arbitrary origin differs from the mean by a fraction 
of a unit or a large number of units. 

The square of the standard deviation, o?, is frequently an 
essential constant. It is designated by u,, meaning the second 
moment about the mean. Without further explanation the 
meanings of the various moments, all taken from the mean, 
will be understood from the following equations, in which x, 
as usual, stands for a deviation from the mean: 


‘ Zz 
The first moment, wi = ae =0 [23] 
>) 2 
The second “ w=er= lls [23 a] 
f (Definition of the moments) }, 
z 
The third ‘“ yps= = [23 8] 
oe 4 
The fourth “ Mw = a [23 c] 


etc 


If deviations from an origin, P, 6 distance from the mean, O, 


80 STATISTICAL -METHOD 


are calculated, then O —P = 5, and x =é& — 46, and the 
following relationships hold: 


Byes) ees ee 
he N N 
Bis Dk Gi ON) cee es ag 
pe = N saa, 6 
eRe ae RSs ny ele eee hee 3 
3 NV ote, vegan tee ae 59 apes 
z(E— 34 _ et Se he eee ene eee 
ug = V = yon 4 eae ory WV 46 wee 
pvees oe ze ED ener 
a 46 + 6 62 — V 36 
Ms = etc. 


If 1, pe, etc., stand for the moments around the arbitrary origin 
the above equations may be more simply written: 


ig Ba [24] 
= = The moments about the mean 
Cae a “s ie determined from those about (24 a] 
Ha = Ha — BY any arbitrary origin) [24 0] 
vs = Bs — 3 Moi + 2 My [24 c] 
bs = Ms — 4 Mshi + 6 poe?) — 3 BAL [24 d| 
euc; 


The following formulas give the same results and are usually 
the more serviceable, 


m1 = 0 (Moments about the mean [25] 
Ho = He — By determined from the mo- | [25 a] 
Ms = Hs — 3 mem — BY ments about any arbitrary | [25 }] 
wa = Ta — 4 ust — 6 wR, — Ft) — OTigin) [25 ¢] 


Cimon 


It is sometimes desirable to determine the moments from 
some arbitrary origin knowing them from the mean. Solution 
of the preceding formulas gives: 


n (n — 1) 


Hn = pn t+ NM yn-1 fr + =] Mn—2 My 


n(n — 1) (n — 2) 
ar 3! 
(Moments about an arbitrary origin deter- 
mined from moments about the mean). . .[26] 
In case the grouping is not fine a small correction to the y’s as 
given in formulas [68] is necessary. 


Hn—2 i> + +> 


MEASURES OF DISPERSION 81 


We may now investigate some of the properties of the 
standard deviation. Let us compare the magnitudes of two 
standard deviations; (a) taken from the mean, O, and (b) 
from a point, P, 6 distance from the mean. O — P = 6, 
and « = € — 6. Let o = the standard deviation from O and 
s = the standard deviation from P: 


2. 2% 
aN 
De S(e+s)2? Tx? = z 8? i 

2— = = — —— ——= = 

Ss WV WV y 1294 wy and since 0 
Hence 

52 SG Toe cheer hone CECE ALOR ee Neate RC ete amet ee a [27] 
or 


s=Vs?+ 6? (Standard deviation about an arbitrary origin deter- 
mined from the standard deviation about the 


Since 6, whether positive or negative, enters into this expres- 
sion as a Square, s* > o’; in other words, the standard devia- 
tion is a minimum when taken from the mean. This is a very 
important property of the mean. 

Formula [24] for uw. gives the standard deviation squared in 
terms of the moments about an arbitrary origin. Formula [27] 
for s? gives the standard deviation squared from an arbitrary 
origin in terms of the second moment around the mean and 
the distance between the mean and arbitrary origin. It should, 
however, be noted that neither of these formulas gives the 
standard deviation around a second arbitrary origin in terms 
of the moments around a first arbitrary origin. This problem 
may readily be solved; if P and Q are the second and first 
origins and if £ and ¢ are deviations and s and S standard 
deviations around these origins respectively, we have: 


JP = (Q) = ps 
Se AN 
2 E(¢—A)? Bet—-2As¢+ NA? 2 
st = 2S ‘Ss we g aes BE ys ey) 
(Relation between standard deviations 
about two arbitrary origins) .... .[28] 


Expressed in words: if moments around any two origins are 
taken, the second moment around the second origin equals 
the second moment around the first origin plus the square of 


82 STATISTICAL .METHOD 


the difference between the origins minus twice the product of 
the difference (taking the second origin minus the first) and 
the first moment around the first origin. 

The formula as written is to be used in determining the 
second moment around the ‘‘second”’ origin when the moments 
around the “‘first”’ origin are known. 


Section 22. THE STANDARD ERROR OF THE MEAN 


If it is desired to determine the reliability of the mean it is 
necessary to have an estimate of how a number of equally 
excellent, i.e., similarly derived, means distribute themselves — 
that is, a new distribution is to be conceived with the means 
themselves as the gross scores. The standard deviation of 
these means is indicative of the precision of any one of them. 
If this distribution of means has a very small spread, or standard 
deviation, then any one of them is a good measure, good in 
the sense that it is a close approximation to the mean of all 
the means. We thus need oy, the standard deviation of the 
means. If there are M sets of N measures each, and if the 
mean of the MN (where MN equals a very large number) 
measures, 1.e., the mean of the means, is the true value, or 
true origin, then x stands for a deviation of a measure from 


this origin and le a as , the mean of one set of NV 


measures, is expressed as a deviation from this same origin. 
The standard deviation of such means is om, the standard 
deviation sought. The standard deviation of the distribu- 
tion of measures from the mean of the N measures will not be 
identical with the standard deviation of the same measures 
from the origin as here defined, but the difference may be ex- 
pected to be negligibly small if N is larger than 25, which we 
shall assume to be the case in this derivation. We will desig- 
nate the standard deviation of the original measures by o. 
We have: 


2(" + x2 + Ey 
2 = i N ‘ 
at a memes 6 pepeenaease 20 
MNo*\4 = 
c (See 6X72 X22 KNgt ++ +2 1X2 xex3+-+-+4+2 *v—1*n) 


N 


MEASURES OF DISPERSION 83 


x2 x2 0 He - 
However, ( Cer = *w) =o2, and as 2 designates a 


summation of M such magnitudes, > (#2 al ae i) = Mo?. 
Also 2 %1%2 + 2 x1%3-+--- may be rewritten, «x + x3 -+ --- 
wat N 1 Xk 1 Xo%3 + +++ xen + HX 9x1 + Wate + XyK4+ +++ KeN 
+») which, if S;, So, «:> stand for summations of N —1 
terms each, is = %1Si~ + %.Sox + --- xwSyx. Each of these 
S summations is closely equal to zero. [Product theorem, 
see Section 23.] Since these summations are at times small 
positive and at other times small negative magnitudes and 
Since x1 %2 --: are likewise both positive and negative and are 
entirely independent of the S’s, it is clear that the whole ex- 
pression, (71S) + %2S2-+ +--+ xySy) does not vary from zero 
by but a small amount and is negligible in comparison with 
the sum of the square terms. The equation may then be 
written: 


VMo?y = Mo?, or 

ou = Tr (Standard error of the mean) [29] 
This is a fundamental relation applicable when > 25::- 
Expressed in words: The standard deviation of the mean 
equals that of the gross scores divided by the square root of 
the population. 

Any measure whatsoever may be thought of as one of a 
distribution, the variability of the distribution being an indi- 
cation of the error involved when any single measure of the 
distribution, taken at random, is chosen as the value of the 
thing measured. Thus when a measure is taken as the best 
obtainable value the standard deviation of just such measures 
as the one taken is the standard error. Thus the “standard 
error’? of a measure and the ‘‘standard deviation” of such 
measures are synonymous expressions. The relation between 
the standard error and the probable error as derived in Sec- 
tion 28 is 


Probable error = .6744898 standard error [Formula 33 of Sec. 27]. 


84 STATISTICAL. METHOD 


Section 23. THE STANDARD ERRoR oF ANY MOMENT 


The product theorem used in the preceding derivation may 
be stated: 
The sum of products of measures which are 
independent of each other and whose 
means are zero, equals zero. [Product theorem] 
This theorem, only roughly proven above, will later, in con- 
nection with the subject of correlation, be seen to be a necessary 
consequence of independence between measures. By utilizing 
it we may determine the standard deviation of any moment, 
Mn, in a manner very similar to that in which we have determined 
the standard deviation of the first moment, 4, the mean. 
Consider a population composed of M sets of N measures 
each. The n’th moment of the total population is, if Y indi- 
cates a summation of M terms and S a summation of N terms: 
_ = (Sx) 
es) SAL 
The deviation from this value of a determination based upon 
one set of N measures is: 


Se ee | - (4 - _ TS (en — pn) 
N MN | LN we] =[ N | 


This is a small magnitude. The sum of M such would of course 
be zero, but the sum of the squares would not, as there would 
then be no negative terms. Accordingly the standard devia- 


tion desired is: 
: B= al 
NC N 
M 


oun = ed eS ee eee 


Sixt 3 Ln) = (xm, = Ln) mokceets ean = Ln) = 6+ b& apo on, 
jet us say. Then MN o*,, = 4; 2 [Sd in which [S38 = Sé# 
N(N <1) 


+ 2S’ 5p6,, where S’ = a summation of —-——— terms which 
2 
approaches zero according to the theorem just stated. Ac- 
cordingly, 
MNo*s, = ne Se? = eet 


in which Y’ indicates a summation of MN terms. 


MEASURES OF DISPERSION 85 


Replacing the 6’s by the equivalent binomials, we have: 


I 


MNo*%,,, = N Z! (x?"— 2 unx” + yn), which, since 2 un B’x” = 2 MNy2n 
= y (MNuon — MNp2n) 
= a Han Hn (Standard f 
is to a andard error of any moment).......... [30] 


It is thus seen that the standard error of any moment is de- 
termined when that moment, the moment twice as large, and 
the population are known. It is to be noted that this formula 
is entirely general and does not depend upon having a sym- 
metrical distribution. It only requires that the populations 
dealt with shall not be small. 

Applying this formula to the determination of the standard 
deviation of the mean, ” = 1, and we have: 


om = ou, = Ne (Standard error of the mean) [29 a] 


This is the general formula. It may be written more simply 
for it has already been pointed out that m1 = o, and pe. = o?, 
so that the equation becomes: 


om = TR (Standard error of the mean). . [29] 
This, of course, is identical with that previously derived. 

We may determine the standard error of the standard 
deviation, but shall first need that of the standard deviation 
squared, v2: By formula [30] we have 


oh, = Ne Sf (Standard error of the second moment). .[31] 


It remains to determine what is the square root of a quantity 
corresponding to a given deviation in the quantity itself. 
Consider the magnitudes uw. and (4, + A) and also V2 and 
V2 + A or their equals o? and (co? + A) and also o and 


(« oS ee +... -) - (This latter after expansion of the 
2 Ge O08 
radical by the binomial theorem.) 


86 STATISTICAL... METHOD 


It is seen that corresponding to a small error A in o’, there is 
an error 


20 -gate -) 


in ¢. However, in all ordinary situations, A®/8 o? and higher 
terms are negligible in comparison with A/2o, so that we 
have: 


Co ——- 
See ae 
aise oe (Standard error of the standard deviation) -[32] 
oO 
Utilizing formula [51] of Section 26 we have 
o (Standard error of the standard deviation in a 


Ce. = ——— 


V2N MOTMALGIShD tO) eae ot eee ee eee [32 a] 


Section 24. THE STANDARD ERROR oF A CLASS FREQUENCY; 
OF THE MEDIAN; AND OF A PERCENTILE 


The deviation in the value of the median is a function of 
the deviation in the frequencies below, or above it. Consider 
the accompanying graph to represent the distribution of cer- 
tain scores in the case of a very large population. If A fre- 
quencies are transferred from below Mdn, the median point, 
to above it, the median would be shifted up. The amount of 
this shifting may be readily determined. 


‘ 
iJ ' 
ris 
\ 


dn 


Let f = the frequency in a small interval of range, 7, near the 
center of which is the median. 

Then the new median has been shifted an amount 7(A/f) 
above the old median, assuming that the frequencies in the 
interval 7 distribute themselves in a rectangular manner. The 
fact that this assumption is not the most reasonable which can 
ordinarily be made has entirely insignificant influence in case 
distributions do not show very exceptional rates of change in 


MEASURES OF DISPERSION 87 


the vicinity of the median and in case populations are not small, 
let us say not less than 2s. 

It is thus seen that corresponding to a change A in the number 
of frequencies below the median, there is a definitely established 
change in the median. The standard error of the median may 
therefore be written, 


t 
CAN IGh Ce aS Hoorn dons ui eudo onal Ol 


Ae 
It only remains to calculate the standard deviation of the 
A’s_and substitute in the above expression in place of o, to 
have the standard error of the median. 

In drawing a sample of m measures from the total population, 
in which the chance of each measure lying below the median 
is one half, we will call those which lie below the median 
successes and those above failures and we will let F equal the 
number of successes. If two scores are drawn (n = 2) then 
the chance of both being successes; of the first being a success 
and the second a failure; of the first a failure and the second a 
success or of both being failures is [(1/2) (1/2)]in each instance. 
Each of these is equally likely to occur, so that if a large number, 
N, of such samplings of two are made we have the following 
distribution of successes, or of frequencies lying below the 
median: 

SUCCESSES 
e} 


I 
2 


R 


g 
Z 
2) 
2 
tes] 
a 


IH lH bl q 
Mtl 
Za, 
Hel AO I 


= 
LS) 
= as 
NIK RIK Wie PD 
XK KARE 
ll 


That is, one fourth of the samplings will show no measures in 
this category (below the median), one half will show one 
measure in it, and one fourth will show two measures in it. 

If three scores are drawn at a time there is just one permu- 
tation yielding three successes, three permutations yielding 
two successes and one failure, three yielding one success and 
two failures, and one yielding three failures, so that we have 
the following distribution: 


SUCCESSES FREQUENCIES 
° N3iX4X4=N} 
I N3X4X4X3=N8 
2 Ne Kae <a Me jays 
3 NZXtXt=NG 


88 STATISTICAL METHOD 


That is, 1/8 of the samplings will show zero successes, 3/8 one 
success, 3/8 two successes, and 1/8 three successes. 

If four are drawn (nv = 4) the frequencies will run N(1/16), 
N(4/16), N(6/16), N(4/16), N(1/16), and in general, if m are 
drawn at a time the frequencies will be given by the coefficients 
of the successive terms of the binomial N(.5 + .5)”. Dropping 
N, which is a constant throughout, the general distribution 
may then be written: 


SUCCESSES IN 


DRAWINGS OF FREQUENCIES 
AT A TIME 
fo) 1()n 
I m (2) ” 
n(n —1) ,, 
2 a ee 
— ® 
n (n — 1) (n — 2) 
3 (3) n 
Tee OSes 
Gus etc: 


Starting with this distribution we could readily determine its 
mean and standard deviation, but as it is just a special case 
of the more general problem in which the chance of success 
for any single drawing is p (p not necessarily 4) this latter 
will be attacked. 

Let » = the chance of success and g that of failure. Then 


De Gi Shia ais cic ennt eee ee ae 


Following the same argument as for p = g = .5, the distribu- 
tion of successes when u at a time are drawn becomes: 


SUCCESSES IN 


n DRAWINGS FREQUENCIES 
fo) I q” 
I nqr\ p 
2 ab (seal) eee 
ae as gq” p* 
: i ere: 
rex 2 Ks 
etc. etc: 


We will now proceed to calculate the standard deviation of 
these numbers of successes by calculating the second moment 
from the point ‘‘zero successes,’’ and then transferring to the 
mean by the aid of formula [22]. 


MEASURES OF DISPERSION 89 


SUCCESSES IN 


DRAWINGS OF 1 FREQUENCIES 
AT A TIME 
xX i fx 
Q® we ° 
I npgr-* npgr—* 
n(n — 1 ~ 
2 eee np(n — 1)pqn—? 
n (n — I) (n — 2) — = 
3 bee 3 gee Me a ne * ee 
n(n ~1)(n—2)(n—3) 4, 4 mb (n —1) (n —2) (2-3) 
4 = Cn ee b*q 4 Poa piqr—-4 
ele: eve: WHO, 
Zf=(@+q"*=1 ZfX = np (p + g)"—1 = np 


Therefore u; = ve = Do oo ESI 


bp Se 
(@) 
np (n — 1) pqr—? + np (n — 1) pqr-? 


np BU = 2) gran —s + np (n — 1) (m — 2) prgn—* 


i <2 
(n — 1) (n — 2) (n — 3) C=O 
an eae a 3) pign—4 +4 np (ary e S 2 (tt = 3) pign-4 
SUC, « 
z fX? = np (pb + g)"—! + np? (n — 1) (b + g)*—-? = np + n°p? — np? = pe 
Therefore we = npq, and« = Vnpq........ [36] 


The third and fourth moments, derived by the same process, 
are: 

ste G) (Gu D) eedey ie hoe ee [37] 
npg [1 + 3 (m — 2) pg]..............[38] 
They are recorded here for future reference, but are not used 
in the immediate problem, — the calculation of the standard 
error of the median. 

The magnitude pe is the standard deviation squared of the 
sum of the frequencies in a category for which the chance of 
each of the separate measures being in the category is p. Thus 
if N (instead of n as above) equals the size of the sample drawn, 
F the frequency in a certain category, p the likelihood of the 
measure lying without it, then 


Ma 


o, = VNpq (The standard deviation of the er 
quency in a given category) . . .[39] 


go STATISTICAL -METHOD 


If the proportion in a category instead of the gross frequency is 
considered we have 


I 


aad =H 


p= N op, SO that finally 


o, = \ a (The standard deviation of a proportion) . . [40] 


This is the basic formula underlying the theory of contingency, 
i.e., the statistics of categories. 

We may use this general result in determining the standard 
deviation of the frequencies below the median. In this case 
p = q = j, so that 


VN 


oR 2 


This is the standard deviation of the A’s, required to determine 
the standard error of the median. Substituting in [33] 

oMdn = ive (The standard error of the median) .. . [41] 

By parity of reasoning the standard error of any percentile 

may be found. Using the same notation as in Section 13, it is 

ep, = in Nog (The standard error of a percentile)...... [42] 

Formula [42] is ordinarily the one needed, but for certain 

problems the existence or assumption of normality permits 

the use of the following (Kelley, 1921, new); 


a0: Nz (The standard error of a percentile of a 
°Pp 2NN normal distribution) ....2...5..2. 0.50143) 
in which o is the standard deviation of the distribution and z 
the ordinate corresponding to q as given in Table K-W. 

A precaution is necessary in using formulas [41] and [42] in 
that, theoretically, f is the frequency in the interval 7 in the 
case of a very large population. A single class frequency for 
ordinary finite populations is a quite unstable magnitude, so 
that in determining the class frequency for f it is well to smooth 
the curve in the neighborhood of the percentile by averaging 
the three or five class frequencies nearest to it. The exact 
number to be averaged depends upon local periodicity and the 


MEASURES OF DISPERSION QI 


total population, but as a general rule for populations less than 
200 it is advisable to average such a number as extend over 
approximately 1/8 of the total range. For larger populations 
a smaller number of intervals may be averaged. It is obvious 
that the same result is accomplished if the frequencies in a 
small number of neighboring intervals are added to give the f, 
and the total range covered by these intervals taken as the 1, 
used in the formulas. 

The standard errors of the two most important averages 
have been determined. That for the mode, except when cal- 
culated by determining the equation of the curve which fits 
the data, is known to be very high. No simple formula for 
its determination ts available. 

In order to compare the reliabilities of different averages we 
will calculate the standard errors of the mean and of the 
median for the temperature data of Table VII. 


M = 81.55; Mdn = 81.25; « = 6.19, N = 62 


To compare with this, the standard error of the median will be 
calculated, using five different intervals in the neighborhood of 
the median. 


V62 
(a) t=1. fof interval, 80.5°-81.5°, =8. omin=——~ X5 =.493 
V62 
(Oie—2, 7 Of taterval, $0,5°-82.5°, = 1,3. oMan=—~ x57 606 
(c) 1=3. f of interval, 79.5°-82.5°, = 23. oMdn =.514 
(d) 1=4. f of interval, 79.5°-83.5°, = 30. oMdn =.525 
(e) +=5. f of interval, 78.5°-80.5°, =31. oman = .636 


It is well-nigh impossible to say which of these five values is 
the most reliable, but since the population is only 62, the last 
value, .636, based upon an interval which is 1/7 of the range is 
rather to be preferred to any of the others. Accepting it as 
the best value it is seen that the median has a smaller standard 
error than the mean. This means that, if this sample of 62 
is truly representative of the distribution of temperatures, the 
median of the distribution can be determined with greater 


92 STATISTICAL METHOD 


accuracy than can the mean, and that accordingly the median 
is preferable in this instance to the mean, as a measure of 
central tendency. Other considerations may enter in, such 
as, for example, the desirability of combining different sets of 
data, calculating correlations, etc., in which case the mean 
should always be used, as it permits of such statistical treatment 
whereas the median does not; but if such considerations are 
not present the proper average to use 1s the one which 1s the most 
reliable. It is thus seen that the all too customary choice of 
an average ‘“‘because of the nature of the distribution” should 
give way to a choice based upon rigorous statistical considera- 
tions as to reliability. Having decided upon an average the 
appropriate measure of dispersion follows as a consequence — 
the quartile deviations or preferably D, the 10-90 percentile 
range, should be used with the median, and the standard 
deviation with the mean. The standard deviation is much 
the more reliable of these two measures of dispersion for all 
ordinary uni-modal distributions, even though they be very 
appreciably skew. Therefore, if, for a certain investigation, 
the measure of dispersion is a more important measure than 
that of central tendency, no error would ordinarily be made if 
the mean and standard deviation are chosen, no matter what 
the reliability of the median may be. 

The reader will have noted that measures 0 reliability are 
simply measures of dispersion. Any measure not infallibly 
determined may be thought of as one of a population of such 
measures. It then only remains to calculate a measure of 
dispersion for this population to secure an index of the relia- 
bility of the measure. The measure of dispersion most uni- 
versally available and most reliable is the standard deviation. 
The range though frequently available, is very unreliable and 
should be used for rough or hasty determinations only. The 
relationship of the five measures of dispersion — standard 
deviation, mean deviation, 10-90 percentile range, quartile 
deviation, and the range, to each other will be considered in 
Section 31 and Problem 1, Chapter V, for the normal distri- 
bution, which is probably more typical of uni-modal distribu- 
tions in general than any other single distribution. 


MEASURES OF DISPERSION 93 


PROBLEMS 


1. Calculate the first and second moments from ‘‘zero income”’ for the 
data of Table X and by proper transformation (a) determine po, the second 
moment from the mean, and (b) determine the second moment from the 
median by formula [28] and check by formula [27]. 


2. Calculate the standard errors of the (a) L. Q., (6) Mdn., (c) U. Q., 
(d) M, for the hypothetical distribution of incomes, Table X. Which is 
the more accurate average for these data, the mean or the median? 


3. Using the grouped data giving changes in wholesale prices, Table XV, 
determine which is the more reliable average, the mean or the median. 


4. (a) Which is the more reliable average, the mean or the median, in 
the case of College Marks, Table XVIII? 
(6) In this case what is the proper number of class intervals to com- 
bine in determining the standard error of the median? [Answer to (6): 
The population, 773, is large and an interval of three units, 7g, the range, 
would be reasonably satisfactory were it not for the fact that there is a 
decided periodicity, which is irrelevant so far as pupils’ talents are con- 
cerned, so that the proper interval is one of five units.] 


5. (a) Determine the standard error of the second moment of the in- 
come data, Table X. 
(b) Determine the standard error of the standard deviation of the 
same data. 


6. Derive pu; and p4 for frequencies given by the terms of the binomial 
(pb + q)” in a manner similar to that illustrated for wu; and we. Much 
scratch paper will be needed. 


7. Prove that if c is a constant and « a variable then 
Co ane 
8. Devise a formula similar to [7 a] except that the sum of the measures 
above the mean instead of the sum of those below is involved. 


CHAPTER V 
THE NORMAL PROBABILITY DISTRIBUTION 


Section 25. DERIVATION OF EQUATION OF NORMAL 
DISTRIBUTION 


Many frequency distributions are very similar in type. 
These distributions are characterized by being symmetrical 
with respect to the mean; by having a single mode which is 
at the mean: i.e., the slope of the curve at the mean is zero; 
by tapering off from the mean and in such a manner that the 
slope again approaches zero as the frequencies or ordinates 
of the curve approach zero. The symbol y will be used for 
the ordinate unless N = 1.0, in which case z is used to conform 
with certain tables in this text and with Sheppard’s tables. 
Following Pearson, we may derive the simplest curve which 
has these characteristics. It is necessary to use the calculus 
in this derivation, so that one unfamiliar with it may simply 
note the conclusions. 

The differential equation dy/dx = Cxy is an equation, origin 
at the mean, whose slope is zero both when x is zero and when 
y is zero. It is the most concise form imposing the required 
slope conditions of any which has been noted by the writer or 
any which he is able to conceive. Integrating this equation 
gives: (All the integration formulas used in this chapter may 
be found in Peirce, 1910.) 

x2 
y=ke 

If k and C are both positive it is found, by plotting or by 
more analytical means, that the curve has a minimum instead 
of a maximum at x = o; also that y does not approach zero for 
any real value of x. It is therefore necessary that C be negative 


or setting C = —c the differential equation may be written 
dy/dx = —cxy and the integral 
ce 
y=ke 2 


94 


NORMAL PROBABILITY DISTRIBUTION 95 


Let us investigate the moments of this curve. If N is the 
total population or total area under the curve 


(oe) 
Nuo =S ydx = ka|22 = N 
—oo c 


oe) 
Nia=f yx ds = NM =o 


9 
- = 
Nias, ant dna Ne ane ee 
oO) c Cc 


Solving the first and third of these equations for c and k gives 
c = 1/o2 and k = N/a V20 
This gives as the final equation of the curve 


—x2 


NV _ eae (The Normal Probability Curve)......[44] 


Be 
in which y is the frequency or ordinate corresponding to a 
deviation x, N is the total frequency, o the standard deviation 
of the measures, = 3.1416, and e = 2.7183 — the Naperian 
base of logarithms. ‘This equation is identical with the fol- 
lowing convergent series: 


y= elt - (Se) tase) -) +] 


Section 26. CERTAIN PROPERTIES OF THE NORMAL 
DISTRIBUTION 


The first derivative of equation [44] is: 


and, as the mode of derivation necessitated, it has a maximum 
at the mean (x = o) and a zero slope at the extremes (y = 0). 
The second derivative is: 


dy -N a8 (2 ) 
i as it Rs tret aaeena 7 
This is zero when x equals plus or minus o, so that the points of 
inflection of the normal probability curve are at points one 
standard deviation above and below the mean. 


096 STATISTICALS METHOD 

The first moment, 44, for the entire curve is of necessity zero 
as deviations are measured from the mean, but if the first 
moment from the mean for half the curve, Rae is found it will 


give the average or mean deviation. 


It is thus found that the average or mean deviation is .7979 
times the standard deviation. 


M. Dev., or Av. Dev. = .7979 0 (Relation between average deviation 
and standard deviation in case of 
anormal distribution).......... [48] 


It is frequently desirable to know how far out, in both 
directions, it is necessary to go to secure one half the total 
frequency. This distance is called the probable error because 
of the fact that if the distribution is one of magnitudes varying 
by chance from some one magnitude (the mean) then the 
chances are one to one that any single measure will vary from 
this magnitude by an amount as great as the probable error. 

The area under the curve is given by the integral, {zdx. 
Therefore if the equation 


n |S 


x 
=f ydx 
bese, 


could be solved for x, it would give that distance which if 
measured in each direction from the mean would include one 
half the total population. The integral desired may be ex- 
panded into the following convergent series: 


x N x I Cin I Su aN\e 
ser beerectker el) “irate 
{> i. V 3 OVS uO oV2 as 5:2! oV2 


ara Gard poe 


Setting this equal to .25 N, the number of cases between the 
mean and plus one probable error, and solving for x gives 
.6744898 oa, the value of the probable error. 


NORMAL PROBABILITY DISTRIBUTION 97 


Section 27. KeLLEY-Woop TABLE oF THE NoRMAL 
PROBABILITY INTEGRAL 


The upper limit, x, of the integral, J = we zadx, when N =1 
1) 


and o = 1, has been evaluated for values of the area, I, by 
.oor’s, from .ooo to .499 and are tabled in the K—W table,* 
given in the last pages of this text. The argument for the 
table is either J, the area from the mean on to the stump of 
the distribution, g, the area of the smaller portion cut off, 


or p, the area of the larger portion. J in this table equals : 
of Sheppard’s tables, but whereas the tabulated entry in 
Sheppard’s most extensive table is S and the argument is x, here 


the tabulated entry is x and the argument J. In both tables 
the ordinate is a tabled entry. The two tables supplement 
each other. Sheppard’s tables will be found the more con- 
venient to use if deviates are known and either areas or ordinates 
desired, while the K—W table will prove the more serviceable 
if areas are known and deviates or ordinates desired. For 
expressing a distribution composed of categories arranged in 
a rank order and having varying frequencies, in terms of a 
normal distribution, the K—W table is much the more service- 
able. Continual reference to Table K—W is made in subsequent 
chapters of this text and if the meaning of I, gq, p, x and z are 
definitely fixed in mind it will greatly assist in the understand- 
ing of subsequent derivations and formulas (cf. pages 371-383). 

* The table is called the Kelley—-Wood, or K—-W, table because Dr. Ben D. Wood calcu- 
lated by interpolation, using third and fourth order differences, from Sheppard's tables, 
values of the abscissa x corresponding to areas from J = .000 to I = .400; because my wife 
calculated, by formula [49], values at decreasing intervals from J = .400 to I = .4990, and 
because I calculated by interpolation certain values of the deviate from I = .4ootoI = .499 
and also calculated either by interpolation or by the aid of eight place logarithms, values 
of the ordinate, z. The labor has been substantial and I commend to the inquisitive the 
calculation of the deviate for J = .490, which Mrs. Kelley determined to be equal to 
30,9022850+. 

Columns J, x and g constitute the basic table of the probability integral, but the added 
columns z/q, z/p and pq, also calculated by Mrs. Kelley, will be found serviceable in many 
formulas. 


The last figure of the entries in the basic table may be expected occasionally to be in 
error by 1.—T.L. K. 


08 STATISTICAL METHOD 


Section 28. FURTHER PROPERTIES OF THE NORMAL 
DISTRIBUTION 


The probable error was found by means of formula [49]. 


P. E. = .6744898 « (Probable error of any magnitude in terms 

of the standard deviation or standard 

error of the magnitude)............-[50] 
It is to be noted that the probable error is defined as a certain 
fixed fraction of the standard deviation, or standard error. 
The relationship that half the population lies between plus 
and minus .67449 a, is strictly true only in case of a normal 
distribution; however it is the customary measure to use 
whenever thinking of chance variations, whether the distribu- 
tion under consideration is normal or not. It must be defi- 
nitely kept in mind that the P. E. has no status or means of 
calculation independent of the standard error; it is simply a 
measure of deviation .67449 times as large as the standard 
deviation and should not be confused with the quartile devia- 
tion which, regardless of the shape of the distribution, is one 
half the distance from the lower to the upper quartile. From 
the lower quartile to the upper quartile is always a distance of 
2Q and is a range that always contains just one half the 
measures, whereas from 1 P.E. below the mean to 1 P.E. 
above is a range that contains exactly one half the measures 
only in the special case when the distribution is normal. It is 
to be expected that distributions of measures which are com- 
posite measures based upon a large number of separate scores 
will in general more closely approximate a normal distribution 
than do the distributions of separate scores themselves,* 
so that the error introduced in thinking of 50 per cent of the 
cases as lying between + 1 P. E. and — 1 P.E. is very small, 
if the P. E. under consideration is that of any average, of any 
coefficient of correlation, of any measure of dispersion, or in 
fact of any measure whatever derived from a large number of 
other measures. Quite substantial error may, however, be 
introduced if the P. E. of the distribution of original measures 
is taken as such that 50 per cent of the cases lie between 


*T have not proven this analytically but have found it to be true with many distribu- 
tions with which I have had to deal. — T. L. K. 


NORMAL PROBABILITY DISTRIBUTION 99 


+1P.E. and —1 P.E. (See problems 2 and 3 at end of 
chapter.) 
Certain important relations between the moments of the 


normal distribution exist. The third moment, p3 = e Ata “yactde, 
ao 

of course, equals zero as the curve is symmetrical with respect 

to its origin, the mean. 


For the fourth moment we have: 
I (ee) 
ji = Nie OG E8 Os ates Hone. [51] 


These last two relationships are important in that they provide 
a means of determining how closely given data fit a normal 
distribution. If uw; = o and wy = 3% the fit is entirely satis- 
factory and the normal curve will better fit the data than any 
other uni-modal curve. If these two relationships do not 
exactly hold, the significance of the discrepancy can be deter- 
mined by the formulas giving probable errors of any moments, 
given in the preceding chapter, or more nearly by determining 
the values and probable errors of two constants 6: and fy. 
These are used in all curve fitting following Pearson’s method, 
and are defined by the equations: 

bs [a {Formulas 69 and 70 

tes po ih ps of Sec. 36] 

For a normal distribution 6; = o and 6, = 3. The probable 
errors of 6, and 8, may be found from Tables 37 and 38 of 
Pearson’s Tables. If for any distribution the obtained ’s 
differ from o and 3 respectively by amounts which are small 
with reference to their probable errors the data may be con- 
sidered normal. The probable errors of these 6’s will be found 
to be large if the populations are small. This is simply indica- 
tive of the fact that it is impossible to determine the type of a 
distribution from a small population and it is scarcely worth 
attempting unless the population is over 100. 


Section 29. Properties or PorTIONS OF A NORMAL 
DISTRIBUTION 
The method followed in the calculation of the average 
deviation is serviceable in determining the mean deviation of 
any tail of a normal distribution. Let a “unit normal distribu- 


100 STATISTICAL METHOD 


tion’’ be one of standard deviation and population each equal 
to 1, then the mean deviation from the mean, of the tail of a 
normal distribution covering the portion from x to « is given 
by the equation: 


S yx dx (Mean deviation of 
M.D £ofoil me o?yx ox the tail of anormal 
Seg rg ENG aN eke aos distribution)...... [52] 


in which y, is the ordinate per unit base at the point of trunca- 
tion; Nx is the number of cases lying beyond this point; 
z, is the value of the ordinate of a unit normal curve at the 
stump or point of truncation x, and q, is the number of cases 
in the unit normal distribution from the point of truncation 
x onto o. Incase of a unit normal distribution we have: 
-. _% (Mean deviation of the tail of a unit 
Ma Deyo. a als gi normal distribution)’ 4.072 -6s. eee [53] 
This magnitude, 2/g, is given in Table K-W. In case q <.5 
use column ‘‘z/q” and in case g >.5 use column ‘‘2/p’’. 

This relationship between ordinate and mean deviation of 
tail is one of the unique and very interesting properties of the 
normal distribution. It has many applications, one of which 
is considered herewith. In case the tail is one half the curve 

2 
we have: .7979 0 = =n’ in which y is the ordinate per unit 
base interval at the mean. Solving for o gives, approximately, 


_.4N_ (Formula for roughly determining the standard deviation 


sf Yo of a distribution which is approximately normal).... ..[54] 


Accordingly, if a rough estimate of the standard deviation of 
a distribution will suffice, it may be obtained by dividing .4 
of the total population by an estimate of the height of the 
ordinate, at the mean, of the normal curve which would best 
fit the data. 

A simple extension of the method followed in obtaining the 
mean deviation of the tail will give the mean deviation from 
the mean of any part of the distribution. Consider the standard 
deviation and area of the following figure to be 1 and let 
it be required to find the mean deviation, from the mean of 
the entire distribution, of that part of the distribution between 
x, and x2. Let the ordinates at these points be % and z. Let 


NORMAL PROBABILITY DISTRIBUTION tor 


gi and q, be the proportions of the population lying above x 
and x2 respectively. Let d = the required mean deviation; 
d; = the mean deviation of the tail from «; on; dz; = the mean 
deviation of the tail from x, on. Then (q — q) is the pro- 


| 
oes 


portion lying in the interval from x; to x. ‘The first moment 
of the distribution beyond x; is equal to the first moment of 
that part between x; and x plus the first moment of that 
part beyond x», or 
qd = (qi — 2) d+ q2d2 
That is, solving 
= (qi — qi) d + 22 
__ 21 — 22 (Mean deviation of a portion of 
gi — q a unit normal distribution)... . [55] 

The magnitudes qi and q are the proportions lying beyond the 
upper and lower limits respectively of the class involved, and 
z and gz. are the ordinates for these proportions as given in 
Table K-W. 

As an illustration the following problem is given. Assuming 
a normal distribution, express the following school marks as 
deviations from the mean: 


fanny See | is. 

Marks ee qa qz ee 
INDICATED Zi 22 

A 11.4 114 .000 .192900 .000000 1.692 

Bo BAu7 461 114 .397034 .192900 .588 

Cc 32.5 .786 461 .291399 -397034 |— .325 

1D) 10.2 888 .786 .190478 .291399 |— .989 

E 9.0 .978 .888 .052485 .190478 |— 1.533 

F 252 1.000 .978 .000000 .052485 |— 2.386 


102 STATISTICAL METHOD 


The table informs us that a mark of A is equivalent to a posi- 
tion 1.692 standard deviations above the mean of the group, 
that a grade of B is .588 standard deviations above the mean, 
a grade of C is .325 standard deviations below the mean, etc. 

The standard deviation of a portion of a normal distribution 
is developed in Section 60 in connection with another problem, 
— see formula [188]. 


Section 30. Tur PROBABILITY OF EXCEEDING A GIVEN 
DIVERGENCE 


The normal curve assists in establishing the degree of con- 
fidence which may be placed in statistical findings. The 
significance of any measure is to be judged by comparison 
with its probable error. If a child makes a score of 80 on a 
certain test and if the probable error of the score is 5, we may 
estimate the chances of the child’s true ability being as much as 
100. We assume that the distribution of the child’s perform- 
ances would follow a normal curve. Note that the assumption 
is not that the talents of children in general follow a normal 
distribution. This latter might be less reasonable than the 
one we are called upon to make. Moreover, so little differ- 
ence in probabilities, except for extreme deviates, is ordinarily 
consequent to differences in forms of distribution, that the 
assumption of normality is little likely to result in serious 
error for such problems as the present one. For extreme 
deviates it generally does not matter so far as any practical 
deductions are concerned whether the chances are 1 in 1000 
or ten times as great. Nor for smaller deviates does it make 
any particular difference whether the chances are 400 in 1000 
or 410 in 1000. Should such differences as mentioned be 
significant in any particular problem, no assumption should 
be made, but the type of the curve should be experimentally 
determined. 

For the problem in hand: If the P. E. is 5 the standard error 


1S (= -) = 7.413. The difference between the scores that 


we are concerned with is (100-80) = 20, which is (. ) = 


NORMAL PROBABILITY DISTRIBUTION 103 


2.698 standard errors. The K—W Table, or more conveniently 
for this problem Sheppard’s Tables, may be used to find the 
area in the tail below the point which is 2.698 standard devia- 
tions below the mean. The tables give .0035. To interpret 
this we should postulate the person’s true ability as being 100 
and his various performances distributing themselves in a 
normal distribution, with standard deviation equal to 7.413 
around this mean. Then .0035 of the area of the curve will 
lie below the point 80. Accordingly if his true ability is 100, 
‘only 35 times in 10000, or 3.5 times in 1000, would a score as 
low or lower than 80 be expected. With such figures a person 
could accept the proposition that the child’s ability was not 
as great as 100 with about as much certainty as he can start 
across a business street expecting not to be hit by an auto- 
mobile. It is, in other words, just such a conclusion as one is 
justified in acting upon. 

Table K—W is built upon the basis of the standard deviation 
as the unit of variability, instead of the probable error. If 
probable errors instead of standard errors are known, the 
following table may be used for rough work, thus avoiding the 
labor of division by .6745: 

TABLE XXVI 


The Likelihood of a Difference as Great as this Obtained One 


eee and in the same direction, | and in the same or the opposite 
times its is 100 p in 100, Or 100 p direction, is 2 X 100 p in 100, 
probable chances of its occurring or 200 p chances of its occur- 
eGtOr to 100 q chances of its not ring to 100 (1-2 p) chances of 
occurring its not occurring 
aG 100pin 100 100p to 100g 200pin 100 200 to 100(1—2) 
5 Bi ah WOO By WO (633 74 intoo 74 to 26 
1.0 25 in) 1008 25) to) 775 50 inI00 50 to 50 
1.5 16 inIoo 16 to 84 Beer OOMNs Tm LOM OO 
2.0 g in 100 9 to gI 1.8) 10 LOO 1S) tN o2 
2.5 5 in 100 5 to 95 9 intoo 9g to 91 
3.0 2 in 100 A Wo) Ws} 4 inI0o 4 to 96 
3.5 I in 100 it ie) fo) 2 in100 2 to 98 
4.0 .3 in 100.0 .7 in 100.0 
5.0 .02 in 100.00 .04 in 100.00 
6.0 .OOI in 100,000 .003 1n 100.000 
7.0 .00OI in 100.0000 .00OI in 100.0000 
8.0 .OO000I in 100.000000 .000003, in I00.000000 


104 STATISTICAL METHOD 


Section 31. SuMMARY oF Facts CONCERNING THE NORMAL 
DISTRIBUTION 


A summary of the facts already discovered together with a 
few determined later in regard to the normal probability curve 
gives the following: 

rt. It is uni-modal, symmetrical with respect to the mean, and 
is completely determined when N, the population, M, the mean, 
and o, the standard deviation of the distribution, are known. 

2. The mean, median, and mode coincide. 

3. Measures of dispersion are related in the following 
ways: 

Q =P.E. = .84535 A. D. = .67449 « = .26315 D 

A. D. = 1.1829 QO = 1.1829 P. E. = .79788 « = .31129 D 
o = 1.4826 QO = 1.4826 P. E. = 1.2533 A. D. = .39015 D 
D = 3.8001 Q = 3.8001 P. E. = 3.2124 A. D. = 2.5631 o 


.. [56] 


The range covered by the measures is approximately, 
_In case the total population is 10, = 4¢ 


oe oe oe “ee “e “oe 


GOH bye 

ae ce oe “e ec ae 
200, = 60 

ce ae ce “e ae “ec 
1000, = 70a 


o = approximately .4 N + the height of the smoothed ordi- 
nate at the mean, median, or mode. 

4. The points of inflection of the curve are at distances 1 o¢ 
and — 106 from the mean. 

5. Every odd moment 1, us, u5*:+ Of the curve is equal to o. 
The even moments are given by 


be = o° 

Ms = 3.u2 = 304 
po = 15 82 = 15 o® 
Ms = 105 pte = 105 «8 
Bi = 0, and By = 3. 


6. The mean deviation of a truncated portion of the curve, 
taken from the mean of the entire distribution, is equal to the 
square of the standard deviation of the entire distribution into 
the height of the curve at the point of truncation, divided by 
the number of cases in the tail. 


NORMAL PROBABILITY DISTRIBUTION tos 


7. The most reliable constant of the distribution is the 
standard deviation. Its probable error = 


6744898 ¢ 477 
V2 NAN 

This follows from formulas [32-a] and [so]. 
The probable error of the average deviation = 


GWe esp Ohyyaal owkevea annals 6 aon aaaaec [58] 


-4066 o .510 


iret or “ of its own magnitude ...........[59] 
The probable error of D, the 10-90 percentile range, = 
ie at or oe of its own magnitude......... .[16 a] 


The probable error of the quartile = 


See 5 Oe a of its own magnitude........... [60] 
This follows from formulas [14] and [50]. 

It is thus seen that if N measures result in a certain relia- 
bility in the standard deviation, it requires to obtain an equal 
reliability, 1.14 N measures in the average deviation, 1.58 N 
measures in the 1ro-go percentile range, and 2.72 N measures 
in the quartile deviation. 

8. Measures of central tendency are less reliable than 
measures of dispersion. Little, if any, significance attaches 
to a measure of the unreliability of an average expressed in 
terms of itself, and, furthermore, since in the normal distribu- 
tion all measures of central tendency coincide, it will suffice for 
purposes of comparison to give the probable error of each. 


Par ean 6745 o (Normal or any other distribu- . 


/N CIOL) WSs eee ee Re Mee teeta: [61] 
._ 845350 (Incase of normal distribution 
ee nea calvin ee ee [62] 


P. E. of the mode is unknown unless the mode is determined 
from the equation which best fits the data, in which case its 
probable error compares favorably with those of the mean 
and median. 

It is seen that if N measures result in a certain reliability in 
the mean, it requires 1.57 N measures to obtain an equal 
reliability in the median. 


106 STATISTICAL METHOD 


9. If a distribution is normal the most reliable measure of 
dispersion based upon percentiles is that between the 7th and 
93d percentiles. Of almost as great reliability is the 10-90 
percentile range. 

10. The distributions of frequencies in the point binomial 
(p + q)” closely approximates a normal distribution if m is 
large and neither » nor q very small. For » infinite and 
neither p nor gq infinitesimal the point binomial distribution 
becomes a point normal distribution. 

11. The average deviation from the mean of any portion of 
a normal distribution may be obtained from the equation: 

“ 21 — 29 

i 
in which the q’s are proportions of the population and the z’s 
are corresponding ordinates as given in Table K-W. 

12. The standard deviation from the mean of any portion of 
a normal distribution may be obtained from the equation: 

121 — XoSe 
di — gs 
13. The equation of the normal distribution is 


oy, =1I+ 


—d? [Section 63, Formula 188] 


N oat 
fear toe PUN Bo oan ay SP reic) Ags ec cay cee, ore eat ieee oe [44] 
or, 
2 4 6 
= yet ~ (ag) +3 ae) - a Ge) +) 


PROBLEMS 


1. Given a normal distribution with areas and deviations as indicated 
in the accompanying figure, then (1 — a)/1 is the probability of a measure 


lying in the shaded portion or, in other words, of a measure deviating 
from the mean by a distance greater than x. If the probability of a single 


NORMAL PROBABILITY DISTRIBUTION 107 


measure lying beyond x is this small amount 1 — a, then the probability 
of a measure, in case of a population of N measures, lying beyond this 
point is N(1 —a). If this probability, N (1 — a), equals .5, then the 
value x corresponding to the a is such a deviation that the chances that a 
measure will lie beyond the point x is just equal to the chance that no 
measure will lie beyond it. The distance x is therefore the most probable 
maximum deviation which will be found in the case of a population of N. 
As a sufficiently close approximation x may be taken as equal to one half 
the range. Accordingly using Table K—W the following table is obtained: 
N, SucH THAT 


RANGE x I—a N (I — @) =.5 
30 I.50 -1336 

40 2. 0 -0455 9 

50 2.50 -O1242 40 

60 So .00270 185 

70 3.50 -000465 1075 

8a 4. 6 -0000634 


Complete the table, determining values for 3 o and 8c. [Answer: If the 
population is 4 (more exactly 3.75) the range of the measures is (providing 
the total distribution from which the sample of 4 is drawn is normal) most 
probably equal to 30; and if the population is 8660 the range is most 
probably equal to 8c.] 

2. In the case of the distribution of incomes given in Table X calculate 
the L.Q. and the U.Q. and the points corresponding to — P. E. and 
+ P.E. Compare values found. What percentage of the cases lie be- 
tween these + and — P. E. points? 


3. Do the same for the distribution of Wholesale Price Indexes given 
in Table XIV. 


4. Estimate the standard deviation of the distribution of temperatures 
given in Table VIII and Charts I and II by first estimating the height at 
the mean of the normal curve which would seem to fit the data. 

5. Do the same for the College Marks data given in Table XVIII. 
Compare o found with the correct o. 


6. Group the College Marks data in fives, 47-52 constituting one 
group, 52-57, the next, etc. Plot and from height of the curve at the 
mean, estimate the «. Compare with correct value. What adjustment 
in estimating o by this short method is necessary in case the data are 
grouped? [Answer: The obtained o is in terms of intervals and must be 
multiplied by the number of elementary units in each group to give the « 
expressed in elementary units.] 

7. Verify the calculation of equivalent scores given in Table XXXV. 


8. If the plumage of certain fowl is either blue, splashed, or white, 
and if the percentages in these categories are 28, 60, and 12, what numerical 
values should be assigned to these colorations should it be desired to treat 
them as color deviations in a normal distribution? 


108 STATISTICAL METHOD 


g. Assuming normality of distribution in the temperature data, Table 
VIII, and using 81.548 and 6.190, the values of the mean and standard 
deviation already found, calculate the ordinate at + 1 P.E., 85.723, and 
compare with the actual ordinate. [Answer: Theoretical 3.17, Actual with- 
out smoothing 3.00.] Still assuming normality, what is the average devia- 
tion from the mean of the truncated portion beyond this point? [Answer: 
7.86.] Of the portion below this point? [Answer: —2.62.] 


10. Verify all statements in paragraph 7, Section 31. 
11. Verify statement in last sentence of paragraph 8, Section 31. 


12. (a) Calculate 8; and 2 for the point binomial when p = gq = 1/2 
and m = 25. [Answer: Bi = 0, B2 = 2.92.] 
(b) Calculate 6; and 82 for the point binomial when p = .I, g = .9 
and m = 25. [Answer: #1 = .2844, B2 = 3.204.] 
(c) Calculate 8; and £2 for the point binomial when p and g are both 
finite and m = o. [Answer: 61 = 0, B2 = 3.] 


CHAPTER VI 
COMPARABLE MEASURES 


Section 32. Ture CoNnpbITIONS REQUISITE FOR COMPARISON 


In many studies measures of the same, or nearly the same, 
phenomena are obtained and it is desired to compare results. 
Gross measures or scores can with validity be compared 
directly only in case they are in the same units and have been 
obtained under very similar conditions. There are four 
methods in common use, the purpose of each of which is to 
derive comparable measures from original scores obtained in 
such manner as not to be directly comparable. Of these 
four the first and the only one which is universally sound is 
that based upon the complete equivalence of the scales of 
measurement involved; a second is the ratio or index method; 
a third may be called the equivalence of standard measures 
method; and a fourth may be called the equivalence of suc- 
cessive percentiles method. 

The first method presupposes that the complete equivalence 
between measures is known. If both are rectilinear scales 
and two points of the one have been determined to be equivalent 
to two points of the other, then for every point of the one an 
equivalent point on the other may be immediately located. 
As an illustration of this method may be considered the com- 
parison of two heights, one expressed in centimeters and the 
other in inches. In the case of inches and centimeters the 
two points which have been determined as equal are: 


0.0 centimeter = 0.0. inch 
100.0 centimeters = 39.37 inches 


This type of equating is common both in the physical sciences 

and in the social sciences, but it should be noted that it is 

entirely sound only in case the two scales measure identically 
109 


110 STATISTICAL METHOD 


the same thing in the same linear manner. Any number of 
functions may be found which agree at two or more points, 
but are not identical, such, for example, as, f’ = sin? x; f” = a= 
etc. For each of these the function equals zero when x equals 
zero and the function equals 1 when x equals 7/2, but in general 
fa peep 

The minimum number of conditions which must be met 
before two scales can be fully equated are three. The condi- 
tions are, (a) one point of the first must be known to be equal 
to a point of the second, (b) a second point of the first must 
be known to be equal to a second point of the second, and 
(c) the law establishing the relationship between successive 
points on the first must be known to be the law underlying the 
second. This third condition is the hardest to establish and 
should be examined the most critically. Even in the physical 
sciences it frequently can only be approximately established. 
Compare, for example, the relation between temperature, 
pressure and volume in the case of two gases. When these 
three conditions are met the determining of equivalent scores 
is simple and is just such a problem as that of finding equivalent 
temperatures in the centigrade scale to those in the Fahrenheit 
scale, knowing that o° and 100° centigrade correspond to 32° 
and 212° Fahrenheit respectively and that both scales are 
rectilinear. 

It frequently happens that only two of the three conditions 
mentioned are established, in which case a guess is sometimes 
made as to the third and an equating attempted. The excel- 
lence of the resulting system of equivalent measures is un- 
certain, and all interpretations drawn should be with the reser- 
vation that they are subject to the validity of the assumption 
involved. 


Section 33. THE Ratio MetTHop 
In case conditions (a) and (b) are met, and condition (a) is 
“‘a score of zero on the one scale is equal to a score of zero on 
the other,” condition (c) is frequently assumed to be “‘the same 
proportion between the units of the two scales maintains 
throughout.” With these underlying conditions the ratio 


COMPARABLE MEASURES cit 


method is frequently used. Illustrations will show the hazards 
involved. Given the following sets of data: 


TABLE XXVII 


HEIGHT IN CM. | WEIGHT IN LBs. 


Ihachyamel/N 5 5 o co @ 6 uo o ¢ 138 75 
INAS ENZO ENUM 5. Ge) fon Wega ici Sonus 172 145 


(Data for individual A are those given in Whipple for the average 12.0 
year old boy.) 


TABLE XXVIII 


WEIGHT 
Elephant A. . . 4000 pounds Buttertly) Beane Stats 
Average for species . 3600 pounds Average for species . I gram 


TABLE XXIX 
United States Bureau of Labor Statistics — Average Aug. 15 Retail Prices 


aes FRESH EGGS POTATOES BREAD TEA 
53.6¢ doz. | 3.9¢pd. | 9.9¢ pd. | 65.8¢ pd. 
Average 1913-17 . . 35.8 2p, THe 55-7 


If one is attempting to secure a maturity measure based 
upon height and another based upon weight one might start 
with the following propositions: 

(a) o cm. height indicates the same mount of maturity as o 
pounds weight, (b) 172 cm. height indicates the same amount 
of maturity as 145 pounds weight, (c) the law of development 
of height is the same as that for weight. Of these three state- 
ments (a) is probable entirely sound, (b) probably tolerably 
satisfactory, particularly if dealing with groups and averages, 
while (c) is probable quite absurd. Accepting these three 
propositions is equivalent to saying that scores X; and X, in 
the two measures, which satisfy the following equation, in 
which M, and M, are the means of the two series, are equivalent: 


112 STATISTICAL METHOD 


The ratio is often used with some other magnitude than the 
mean as a base so that a more general statement of the equa- 
tion connecting equivalent scores is: 


X, _ X»2 (Equivalent scores upon the assumption 


By Bz OLequality.ofralios) prere eeer er [63] 


B, and Bz should be values of the variables which are known 
with more than usual certainty to be comparable and reliable. 
It is also desirable that they be not small with reference to the 
scores involved. Due to the greater reliability of means than 
of individual scores the use of the mean as a base has much to 
recommend it. Letting o:1 and o2 stand for the standard 
deviations of the X, and X» scores, one criterion of the sound- 
ness of the assumption of the equality of ratios is: 


B, _ Bz (Criterion to use in judging of the appro- 


o1 o2 priateness of the ratio method)...... [64] 


The use of this criterion is illustrated in the next section in a 
problem in which the bases are the means. 

The calculated ratio scores of Individual A are not equal, for 
A stands (138/172 = ) .802 on the height maturity scale and 
(75/145 = ).517 on the weight maturity scale. Accepting 
proposition (c) one would conclude that individual A is a very 
abnormal person, being some 28.5 per cent more developed in 
height than in weight. In dealing with mental traits not 
amenable to direct observation a conclusion equally as absurd 
as that just drawn might pass for years without discovery. 
In the case of height and weight the fallacy can be immediately 
detected and a method followed which will be more reasonable, 
though it is jmpossible to say that it is entirely sound, as the 
proposition (c) is still an assumption. 

Height being a one-dimensional magnitude and weight ap- 
proximately three-dimensional (a) and (b) stand as before and 
the third becomes: (c) The law of development of height is 
the same as that for the cube root of weight. The comparisons 
then are: Maturity index based upon height = .803. Ma- 
turity index based upon weight = 75/145 = .803. Upon the 
basis of these two figures one would conclude that the individual 
is equally developed in the two traits. This illustration is 
given to show the material differences which result from 


COMPARABLE MEASURES 113 


different assumptions as to the laws connecting successive 
scores of two scales and not to suggest that either of the two 
methods followed is established as sound. At best, in the 
problem in question, propositions (b) and (c) are questionable. 
Logically proposition (a) seems sound, but there are many 
situations in psychology and economics where a similar state- 
ment would be very fallacious. 

The hazards of the ratio method are not lessened when 
dealing with the same sort of function of different things. For 
example, the weight of one child expressed as a proportion of 
the average adult weight in comparison with the weight of a 
second similarly expressed may be very misleading. The two 
children may have very different hereditary endowments, the 
one becoming a normal adult of weight 120 pounds and the 
other a normal adult of weight 145 pounds. The fallacy in 
using indexes in the case just mentioned is the same as that 
for Table XXVIII. Elephant A has a weight index of 1.11 
and Butterfly B one of 2.00. This constitutes no proof that 
as a butterfly B is more exceptional than is A as an elephant. 
It might be true that to per cent of butterflies exceed 3 grams 
in weight and but 5 per cent of elephants exceed 4ooo pounds. 
The indexes do not tell us, but in such case it would seem 
reasonable to call A the more exceptional. 

Using the Labor Bureau data of Table XXIX we find that 
the 1918 August 15 price of fresh eggs is 150 per cent of the 
average August 15 price for the years 1913-17; of potatoes 
177 per cent; of bread 136 per cent; and of tea 118 per cent. 
These four ratios tell an important story, but at the same time 
they may be misleading and for the same reason that the weight 
ratios of elephants and butterflies are misleading. The law 
covering the fluctuation of potato prices is almost certainly 
different from that covering the fluctuation of bread prices 
and similarly for any two of the products which may be com- 
pared. Conditions (a) and (b) may be fairly sound, but very 
questionably so of condition (c): 

(a) o ¢ per dozen eggs indicates the same sort of a price con- 

dition as o ¢ per pound for potatoes. 

(b) 35.8 ¢ per doz. eggs indicates the same sort of a price 

condition as 2.2 ¢ per pound for potatoes, 


114 STATISTICAL METHOD 


(c) The conditions determining the fluctuations in the prices 
of eggs are proportional to those determining fluctua- 
tions in potato prices. 


Because of the peculiar difficulty of establishing condition (c) 
the ratio method for economic and psychological problems 
may be expected to be an artifact and not an exact quantitative 
procedure. 

A part of the error involved in combining price ratios of 
separate items to obtain a general index may be eliminated by 
weighting the separate ratios inversely as the squares of their 
variabilities, as proven in Section g1 and illustrated in Sec- 
tion 90. This method, however, will not result in as great 
accuracy as will one based upon the multiple correlation and 
regression of the prices involved. Further considerations are 
given in Chapter XIII. 


Section 34. THE STANDARD MEASURE METHOD 


This is an outgrowth of the method used by Francis Galton. 
It has certain refinements in the measures involved, but rests 
upon practically the same principle. Galton considered two > 
measures which attempted to measure the same function to 
be comparable when each was expressed as a deviation from 
the median of the group to which it belonged and when each 
such deviation was divided by the quartile deviation of the 
group. The three propositions essential to the soundness of 
this procedure are: 


(a) The median score of the first measure indicates the 
same sort of a condition as the median score of the 
second measure. 


(b) A score of the first measure which deviates one quartile 
from the median indicates the same sort of a condition 
as a score of the second which deviates in the same 
direction one quartile from its median. 

(c) In general, deviations of the two measures which are in 
the same proportion as the quartile deviations are 
indicative of the same sort of a condition. 


COMPARABLE MEASURES 115 


More briefly stated these propositions are. 
(a) Median scores are comparable. 
(b) Quartile deviations are comparable. 


(c) The same proportions as between quartiles holds for all 
equivalent deviations from the medians. 


Since the mean can generally be more reliably determined 
than the median, and the standard deviation than the quartile 
deviation, the Galton procedure has been dropped and the 
following propositions taken as a basis: 


(a) Mean scores are comparable. 
(b) Standard deviations are comparable. 


(c) The same proportion as between standard deviations 
holds for all equivalent deviations from the mean. 


Let 


X,—M, Xo — My 
2) = ——_,, and _ 22 = —————_ 


ol a2 


(Standard measures). . .[65] 


Then the measures to be compared are z; and zz. Such measures 
as these may be called ‘“‘standard measures” as they are meas- 
ures of deviation expressed in terms of standard deviations. 
The last proposition may then be stated: 


(c) Equal standard measures are comparable. 


It should be noted that there is no implication that a zero 
score in the first measure is equal to a zero score in the second 
measure. Proposition (c) always needs experimental verifica- 
tion, but for the usual distributions found in the social sciences 
it seems reasonable to expect that if the means of the distribu- 
tions are set equal, and if points one standard deviation away 
from the respective means be placed together, a better ap- 
proximation to complete equivalence throughout the entire 
scales will be obtained than if the means and zero points are 
equated and other values taken in proportion. The following 
data taken from Pintner (1914) and Kelley (1914 comp.) * 
will illustrate the method and they also are such as do not 


* A numerical error occurs in this reference, the figures herewith presented being the 


correct ones, 


116 STATISTICAL METHOD 


reveal without statistical analysis the inaccuracy of the ratio 
method: 
TABLE XXX 


MEAN Scores GIVEN TO SAMPLES OF HANDWRITING UPON 


No. or SAMPLE 


Ayres Scale Thorndike Scale 
12 20.6 5.9 
6 24.2 6.5 
8 28.4 fee 
MN 35-3 8.4 
4 36.2 8.0 
15 36.3 8.3 
I 37.1 8.1 
22 40.3 8.9 
5 40.3 0 
17 41.8 8.9 
18 48.9 10.1 
14 49.2 10.2 
9 52.4 10.7 
vf S57) 10.6 
24 55-7 10.8 
II 56.0 10.7 
10 56.9 ri.3 
2, Sef 10.9 
13 58.0 11,2 
19 58.9 II.5 
20 64.2 11.8 
23 74.2 13.8 
3 80.1 14.2 
16 82.1 14.8 


Calling the Ayres X; scores and the Thorndike X2 scores and 
calculating the required constants yields: 


M, = 49.60 01 = 15.93 Mz = 10.08 o2 = 2.229 


X,’s and X»’s satisfying the following equation are comparable 
measures: 


Xi — Mi _ X2— M2 (Equivalent scores upon the assumption 

o1 o2 of equality of standard measures) . . . [66] 
Solving for certain values yields the equivalent scores given in 
the first two columns of the following table, XXXI. Treat- 
ing the same data by the index method gives the equation: 
2S Rey fi 

49.60 10.08 

Scores which are equivalent as derived from this equation are 
given in the last two columns of the table. 


COMPARABLE MEASURES LLY 


TABLE XXXI 


Standard Measures Method Ratio Method 
EQUIVALENT SCORES EQUIVALENT SCORES 
Ayres Thorndike Ayres Thorndike 
X14 Xe xX, X2 
= BE 0.0 0.0 0.0 
0.0 3.1 
20.5 6.0 29.5 6.0 
49.6 TORE 49.6 10.1 
70.0 12.9 70.0 14.2 
84.8 15.0 73.8 15.0 


The two methods lead to different results and a very brief 
study of the original data shows that the equivalents obtained 
by the standard measure method are much the more reasonable. 
The fundamental error in this problem of the ratio method is 
in the assumption of equality of zero scores. That this is an 
error would not be self-evident to the user of the scales, as 
samples of handwriting of less merit than 20 on the Ayres 
scale or 6.0 on the Thorndike are seldom found, so that what 
constitutes a sample of zero merit on either scale is quite 
unknown. A similar observation applies to economic situa- 
tions, for who has experience with, or knows the meaning of, 
o ¢ as the cost of, let us say, a pound of bread? 

Reference to the equations giving equivalent scores shows 
that knowledge of the means, in case the means are the bases, 
is all that is necessary to determine the equation giving equiva- 
lent scores in the case of the ratio method; but that an added 
item of information, the standard deviations, is required in 
the case of the standard measure method. If equivalent 
measures really are proportionate as assumed by the index 
method, the equating of standard measures results in the same 
set of equivalents as given by the ratio method. This special 
case exists when 


We AGE xX,—-— M, X_,— Ms, Ay Xe 
ee he for then Fe a reduces to ioe ite 


Accordingly the standard measure method is the more general 
and contains the ratio method as one of its special cases. 


118 STATISTICAL .METHOD 


Section 35. THE EQUIVALENCE OF SUCCESSIVE PERCENTILES 
METHOD 

This method involves no assumption that the law covering 
the relation between successive scores is of any particular type 
other than that involved in the statement ‘‘the larger the 
score the greater the trait, or characteristic, being measured!” 
Otis (1916) and (1918) in dealing with paired measures, has 
used a graphic method which gives a line of “‘rank relation.” 
His method, equivalent to setting the lowest score in series 
one equal to the lowest score in series two, the next lowest in 
series one equal to the next lowest in series two, etc., could 
be called ‘‘the equivalence of successive ranks’? method, but 
the title here given is used as being the more general. The 
method does not depend upon paired measures or upon having 
two series of the same population, though if measures are 
paired and high correlation exists between them the reliability 
of equatings is greatly increased. 

Letting P stand for percentiles in the first series and P’ for 
those in the second, the method assumes that equivalent 
scores are P.o, and P’.o; P.og and P’.o2; etc.; and in general 

P» is equivalent to P’p (Comparable percentiles)... .[67] 
No single one of these equivalents P.o, = P..,, etc., can be 
determined with the reliability that appertains to M = M’, 
or ¢ = oa’, but, unless it has been experimentally determined 
that relationships between the two series are rectilinear, or 
curvilinear according to a known law, a more accurate total 
set of equivalents may be expected from this method than from 
either of the two preceding. Objections to the method are, 
first, that no concise algebraic statement of relationship comes 
from it and second, that it is responsive to chance oddities in 
distributions. This second objection can be largely overcome 
by smoothing graphically as does Otis or by a moving average, 
as will be illustrated, using the data upon handwriting. 

There are but 24 samples of handwriting so that a percentile 
below the 4.1667th cannot be calculated except by an arbitrary 
assumption as to what constitutes the lower limit of the interval 
corresponding to the lowest score. We will therefore begin 
with the sth percentile and, to shorten the work, proceed by 
fives to the gsth. 


COMPARABLE MEASURES 


TABLE XXXII 


PERCENTILES 


EQUIVALENT HANDWRITING SCORES 


SMOOTHED EQUIVALENT SCORES 


Ayres Scale Thorndike Scale Ayres Thorndike 
23.18 6.34 23.1 6.35 
28.52 720 26.7 7.30 
34.19 7.89 32-4 7-90 
36.15 San 34.2 8.20 
36.70 8.35 360.0 8.50 
38.935 8.68 37.8 8.80 
40.145 8.86 39.6 9.10 
43.65 9.31 42.3 9.40 
48.31 10.03 46.8 9.80 
50.80 10.40 49.5 10.25 
54.23 10.66 54.15 10.45 
55.31 10.72 55.05 10.65 
56.21 10.81 55-95 10.85 
B7als II.O1 56.85 11.05 
57.85 11.25 57.75 11.25 
59.07 11.45 59-1 11.55 
64.61 TAI 64.5 12.25 
73-97 13.52 74-4 13.45 
80.31 14.40 79. 14.35 


Differences between Successive Five-Percentiles 


TABLE XXXIII 


Raw PERCENTILES 


SMOOTHED PERCENTILES 


Ayres Thorndike Ayres Thorndike 
5.34 .86 3.6 -95 
5.67 .69 5.7 6 
1.96 .28 1.8 3 
55 .18 1.8 B 
2.235 sexe 1 is) ae 
1.21 18 1.8 B 
3-505 “45 2.7 3 
4.66 SP 4.5 4 
2.49 37 2.7 45 
3-43 .26 4.05 ee 
1.08 .06 9 eZ 
.9O .09 9 2 
.92 .20 9 B 
72 .24 9 2 
i222 .20 1.35 3 
5-54 -66 5-4 7 
9.36 1.41 9.9 tz 
6.34 88 5-4 9 


120 STATISTICAL METHOD 


The smoothed percentile scores have been calculated from 
the original series after grouping the Ayres data in 3’s (score 
at, frequency 1; sc 24,{£1; sc 27, £ 1; se 30, f 0; se 33, fo; 
sc 36, f 4, etc.) and the Thorndike scores in 5’s (sc 60, f 1; 
sc 6.s, f 1, etc.) A moving average would probably lead to 
slightly better results, but would be laborious with the uneven 
spacing here present in the scores. 

We may judge of the excellence of the two sets of equivalent 
scores, since the drawing up of a correlation table for the data 
of Table XXX shows that the relationship between the two 
scales is almost exactly rectilinear, so that differences between 
the percentiles upon the one scale should be proportionate to 
the differences upon the other scale. Columns 1 and 2 of Table 
XXXIII give these differences for the raw data and columns 
3 and 4 give the differences determined from the smoothed 
data. Rather better results are obtained from the raw data 
than from the grouped, as would be expected from data show- 
ing the high degree of correlation here present. The small 
fluctuations are, in material part, not random, but genuine, 
and the grouping process has therefore distorted the facts. 

This method of equating scores is thoroughly empirical and 
therefore applicable to situations in which the law of relation- 
ship between variables is unknown, or at least cannot be stated 
in a simple algebraic formula, but in which sufficient reason 
exists to warrant the equating. 

If several series are to be equated a very serviceable modifi- 
cation of the preceding method is to equate each series, not to 
any one of them, but to a normal distribution. This can be 
done, using formula [s5], giving by the aid of Table K-W the 
mean deviation of a portion of a normal distribution. An 
illustration will make clear the steps involved: 

It is frequently desired to compare the performances of pupils 
receiving marks in different subjects. If the pupils have no 
subjects and no teachers in common, this can only be done by 
making some assumption. If there are three teachers, each 
with 50 pupils, it is more reasonable to assume that the mean 
abilities of the three groups are equal than that similar literal 
or percentage grades of the three teachers are equivalent. The 
data of Table XXXIV present the problem. 


COMPARABLE MEASURES 123 


TABLE XXXIV 


Marks USED | PERCENTAGE | Marks Usep | PERCENTAGE | MARKS USED] PERCENTAGE 
BY First |GrvEN Mark] By SECOND GIVEN Mark By THIRD |GIVEN MARK 
TEACHER INDICATED TEACHER INDICATED TEACHER INDICATED 

A 2.0 A+ Br I 4.3 
B Typo A 13.9 2 Bas 
C Biles A— 4.5 @ 50.3 
D 40.0 B+ 4.6 4 Tell 
E Weil B 29.4 
F 0) B— 4.3 

C+ 4.7 

CS 227, 

D 9.2 

E 6.0 


It is obvious that a mark of A given by the first teacher indi- 
cates greater merit than a mark of A given by the second teacher. 
Equating each mark to a standard-measure score in a normal 
distribution gives: 

TABLE XXXV 


Marks USED | EQUIVALENT | MARKS USED | EQuivALENT | MArxKs UseEp| EQUIVALENT 
BY FIRST STANDARD By SECOND STANDARD BY THIRD STANDARD 
TEACHER MEASURE TEACHER MEASURE TEACHER MEASURE 

A 2.4 A+ 2.8 I 2.1 
B hoe A 1.5 2 8 
Cc 4 A— 1.0 a —.5 
D — .6 B+ 8 4 — 1.9 
E — 1.6 B 2B 
F — 2.5 B= — J 

C+ — .2 

C — 6 

D — 1.3 

E — 2.0 


The method requires little time, but were such equatings being 
done for a large number of classes a still briefer method could 
be followed. Instead of finding the mean standard deviation 
score for the upper 2 per cent, we may find the median: 1/2 
the percentage of A’s = 1.0, therefore from Table K—W 2.3 is 
the standard deviation score which is equivalent to the mark of 
A given by the first teacher. The percentage of A’s plus 3 the 
percentage of B’s = 10.6, therefore 1.2 is the score equivalent 
toB. Similarly, .4 is equivalent toC; —.5toD; -—1.6toE; 
and — 2.4 to F. 


122 STATISTICAL .METHOD 


The marks given by the second teacher are typically those 
of a careful grader and show more discrimination than do those 
of either the first or third teacher, but nevertheless it is more 
reasonable to assume a normal distribution of talent than 
such a tri-modal distribution as is indicated by the second 
teacher’s marks. The method may frequently be used for the 
single purpose of warping data showing an extreme distribution 
into a more reasonable mold. 

The observation has been made that in order to be com- 
parable the two series should be independent measures of the 
same thing. It is shown in Section 56 how certain correlation 
functions enable one to estimate whether two series of scores 
are measures of the same thing. In general it is not necessary 
that a raw correlation between the two series approaching 1.00 
be found, but merely that a coefficient of correlation corrected 
for attenuation of 1.00 be present. 


CHAPTER VII 
THE FITTING OF CURVES TO DISTRIBUTIONS 


Section 36. Metuops or Firrinc Curves To OBSERVATIONS 


The properties of the normal distribution as given in Chap- 
ter V are such that if data fall approximately into this form their 
interpretation and treatment are frequently greatly simplified. 
As a practical matter it is often serviceable to treat data as 
normal even though slight divergence from normality may be 
known to exist. Probably, however, the majority of distribu- 
tions cannot by any stretch of interpretation be considered 
normal. In such case one may resort to one of two procedures, 
(a) either warp data into a normal mold by transformation 
devices, or (b) discard the concept of normality altogether and 
endeavor to discover an equation which does describe the 
data. 

The equation of the normal curve is 


—x? 
ere 


cn re 


Not counting N, the population, which does not affect the 
type of curve, there is only one degree of freedom in this curve 
since o is the only constant which is to be determined from 
the data. To permit of greater freedom one could start as 
did Edgeworth with an equation of the tvpe 
acl 
y= NED e2oF 
OF 27 


in which f is some function of x. As f(x) is made more and 

more general, greater and greater freedom is given. Other 

variations of this approach have been followed by Edgeworth 

(1904), Kapteyn (1903), Thiele (1903) and Charlier (1906). 

Pearson has criticized this method because the function built 
123 


124 STATISTICAL -METHOD 


up is what he terms a ‘‘shadow function,” something not 
corresponding to any physical measurement, not representing 
any relationship which is in itself capable of independent 
interpretation; and as a procedure which tends to make a 
fetish of the normal distribution. However, should this ghost 
take on flesh and bone and be found, in certain important cases, 
to be a measure of what would seem to be a causal force, the 
method would be amply justified. Judgment may well be held 
in abeyance pending further experimental treatment. Later 
in this chapter the normal distribution will be shown to hold a 
unique and peculiarly dominant position among all the Pearson 
curves, but this is not an argument for arbitrarily forcing data 
into this form. It is rather an argument for the study of the 
features of a given distribution which diverges from this form. 
The first four sections of this chapter are concerned with the 
practical details of curve fitting while the theme of the last two 
sections is the bearing of types of distributions upon problems 
of stability and trends in evolution. 


Section 37. ‘THE PRINCIPLE UNDERLYING PEARSON’s METHOD 
oF CurRVE FITTING 


Pearson imposes certain very broad conditions upon the 
differential equation of the curve. These conditions are so 
general that many varieties of non-bi-modal distributions are 
represented. These include (a) curves with a maximum fre- 
quency somewhere between the limits of the range, called 
““i-shaped”’ curves, (b) such as have an anti-mode, or point 
of minimum frequency between the limits of the range, called 
“u-shaped’’ curves, and (c) such as have no mode, called 
“‘j-shaped”’ curves. The present treatment will describe the 
calculation of a few of the more important of the fifteen Pearson 
types, and will present such criteria as are necessary in determin- 
ing the type of curve to which given data belong, so that one 
may then go to Pearson’s Tables (1914 tables) and other sources, 
Elderton (1906), Pearson (1894), (1890 and sup roor), (1902 
sys), (1906 skew), (1915 cert) and (1916 app), and determine the 
equation of the curves. 

The fundamental proposition in Pearson’s method is that in 
order to have a good fit the first four moments of the data 


FITTING OF CURVES TO DISTRIBUTIONS x25 


should equal the first four moments of the derived equation 
and second that formula [81] expresses the general differential 
equation covering all uni-modal curves. The moments are 
fundamental and may be obtained by aid of the accompanying 
formulas. 

Let the required moments be p1, me, Ms, Ma. 

Let the four moments from the mean, but uncorrected for 
grouping be 1, ve, v3, V4. 

Let the raw moments from the arbitrary origin be 4, , 3, V4. 
Then the following equations lead to the calculation of the p’s: 


= 2. = DALES A >TX4 
OEE GN ain ose aN a Ms Bae hare AD a Va 
¥4,= ¥1 — ¥1 =O (Moments from the 
vo = v2 — vy mean, knowing them | [24] 
V3 = V3 — 3 V1 +2 0% from an arbitrary | [21] 
y= V4 rd. V3V1 ++ 6 Voy = 3. py origin) Ps CREO CRO ae see. 
Continuing 
1 = 71 =0 (Sheppard’s correc- [68] 
eat 3 : [68 a, see 
Died tect ene tions applied also Gee 47] 
p3 = V3 to moments [68 6] 
= eee 
M4 = 4 — > LG from the mean) [68 c] 


Sheppard’s corrections are for an error in the moments due to 
grouping. They are to be used in case of “high contact”; 
that is, when the curve approaches asymptotically the base 
line, or x-axis, at both extremities. In case high contact at 
both extremities is not present, corrections as given by Pair- 
man and Pearson (1919) should be used. 

It should be noticed that the »’s are here defined as were 
the p’s in Section 21, that the »’s here are the same as the y’s in 
that section, and that the y’s here differ slightly from the v’s 
(or the p’s of Section 21), being corrected for a grouping error. 

Certain derived constants, 61, 62 and the criterion x are also 
needed in determining the type to which given data belong. 
In earlier work in curve fitting a criterion x, was used and 
though it is not as general a criterion as «2 it has much theoretical 
interest. 


126 STATISTICAL METHOD 


Bi = ue : (One measure of skewness)... [69] 
Ms 
B2 = (One measure of kurtosis). . .[70] 
pe 
Ki 72: 63'— 3) Pi = 6 (Criterion K1) PAY SK ode [71] 
Bi (B2 + 3)? 


ke (Criterion x2)... .[72] 


~ 4 (4 Bz — 3 81) (2 Bz — 3 Bi — 6) 

The connection between the #’s and the type of curve may 
be shown by the illustrative curves of Chart XIX and by 
the following Chart XVIII which has in addition to the 
lines of Diagram XX XV in Pearson’s Tables, certain lines and 
points for more recently discovered types of curves, as well 
as lines giving the finite limits of various moments. The 
meaning of the (u, = ©) lines in Chart XVIII will be clear by 
an illustration. It is found by reference to the Chart that 
the lines (u-s = ©) and ("29 = ©) approximately pass through 
the point (81 = 1.45, Be = 5.66). The equation of the curve 
fitting a distribution yielding these 8’s has all of its moments 
between ps and py finite, and moments outside these limits 
are infinite. For the positive moments the mean, a finite 
boundary, or any other finite point, may be taken as the origin, 
while for the negative moments one of the boundaries of the 
distribution is the origin. For a point above Type III no 
positive moments are infinite and for a point below Type V 
no negative moments (defined furtherin Section 40) are 
infinite. Only certain of the breakdown lines, i.e., lines where 
the moment becomes infinite, have been drawn, there being an 
infinity of positive moment breakdown lines between (jy = © ) 
and Type III and an infinity of negative moment breakdown 
lines between (u-s = ©) and Type V. The discussion of the 
significance of these lines will follow shortly. 

After determining 6; and 6: from the data, a corresponding 
point on Chart XVIII may be located. Should this be a point 
on a line the equation of the distribution will have two degrees 
of freedom in addition to that based upon N, the population. 
If the (61, 62) point lies in a space between lines, the equation 
of the curve has one more constant in it and one greater degree 
of freedom. If the (81, 62) point falls on certain designated 
spots on the lines, especially if it falls where two curves cross, 


Diagram of Types of Frequency Distribution 


k.O 


127 


128 STATISTICAL METHOD 


the equation of the curve simplifies and has but one constant. 
In general the (6:82) point will not lie exactly on a line or on a 
unique point in a line, but if near such a place much labor in 
fitting a curve may be saved by choosing the simpler equation. 
This is frequently permissible, as may be decided from Charts 
and Tables given in Pearson’s Tables, from which the probable 
error of the location of the (8;6.) point may be determined. 
It is therefore possible to tell how unreasonable it would be 
to choose a type represented by the simpler form. 


Section 38. DESCRIPTION OF TYPES OF CURVES 


We will first note points upon the lines which give the very 
simple one-constant equations. Reference to the drawings of 
Chart XIX will show the general form of the curves. 

(M) The point of meeting of the line 8; = 0, along which 
all distributions are symmetrical, and the line, B: — 6) — 1 = 0, 
along which all distributions consist of frequencies in two 
categories. 

[ep SS Oy [ey Sie 

At point (M) two equal categories constitute the distribution. 
Pearson has not given a name to this point nor assigned a 
type number to the line, 6B. — 8B; —1 = 0. Due to the im- 
portance of the 1:1 ratio from the Mendelian point of view 
I have called this point (M). The line might be called the 
Mendelian line, but as it includes all two-category distributions 
and not simply those having Mendelian significance, I will 
call it the Two-Category Type Line. 

(R) The point corresponding to a rectangular distribution. 

Bi = 0, Bo = 1.8 
This point is the juncture of many lines and may therefore be 
considered a special case of any of the types which meet here, 
ie., Types II-u, II-i, I-j, I-i, VIII, [X-1, XII. This point 
shares with the exponential the distinction of being the conflux 
of the greatest number of types of any point in the diagram, 
not excepting the normal point. There is a point, not in the 
field corresponding to real distributions (6; = — 4, Bb. = — 3), 


which is still more exceptional as judged by the number of 
lines which pass through it. 


CHART XIX 


M. Special Case of Type Lu 


Re Rectangle. 


=e +a 


B70 6,-1.8 ye Yo 
Range From -a to +a 


GFO 


Zero Base, or Zero 
Width Class Interval. 


N. Normal or Gaussian. P. Parabolic. 


N ms 
Yeo ete 
Ver 


Bic | @z2°5 


Ko ag ee —————>|« 


-a 


6-9, 62 -27, y= i ‘. Z| 
Range from -a to +a 


L. Line:Point of Division 
Between Type Ki and Ke 


E. Exponential: Type X 


= 4, 2 =2 
Range from O to « 


IN, Curve drawn is yeaa (GR Type Wl. SeeA. 
|Corresponding to Point 6-0,@,=9,andis| |%,=O, @=0, @, >5.0 
Slightly Less PeaKked Than PointA Curve 


More LeptokurFic 
Fhan 
Range from -oc fo cc 


K2°0,3,20, 1.8 < Oe (2.0 
yee [a] 


-—2 +a 
Range from -2 to+a 


129 


130 STATISTICAL METHOD 


(N) The point corresponding to the normal distribution. 
B: = 0, B2 = 3.0 

This is the conflux of Types I-i, II-i, III-i, IV-i, V, VI aud VII. 
All of these are i-curves, that is, they are characterized by a 
single positive mode and have zero frequencies and a slope of 
zero at the upper and lower limits of the distribution. Further 
unique characteristics of this point will be pointed out in con- 
nection with reliability. 

(P) The point corresponding to a parabola. 

Bi = 0, Bz = 24 
This is simply a special point in the Type II-i line. 

(A) The point corresponding to the symmetrical Type VII 
distribution for which the mean and the median are equally 
reliable averages. The point is not here located exactly, but 
it is in the neighborhood of 


[skh SOs, igo ee foty << iyo 


Below this point the median is more reliable than the mean 
and above this point less reliable. It should be noted that the 
line 
8 B2 — 15 Bi — 36 = 0 

is far above this point. The probable error of the fourth 
moment becomes infinity below this line. Accordingly the 
equation of a curve, or any other function involving the 
fourth moment, loses significance. The mean and the prob- 
able error of the mean do not involve a higher moment than 
the second, so that they remain significant for distributions 
for which it is impossible to fit a curve. In other words, the 
fourth moment breaks down as a significant feature of a 
distribution long before the second moment or the standard 
deviation; and these latter in turn break down before the first 
moment, or mean; and for certain distributions (e.g., B: = 0, 
B; > 12.0) the mean breaks down not only when the median 
does not, but when it is in fact rapidly improving as a measure 
of central tendency. Were we to go in the other direction 
into the Type II-u region we would find the median breaking 
down while the mean remains very reliable. This point is 
taken up later. 


CHART XIX — Continued 


Two Category Type 


y-x["B]" mye 
R 


Y=% [+A o>m) | 


Yo 2 


° 


f-0, 68,<5 62-9, 6,>-32 


-& 


T=0, 6B, <56,-9 » 6, >-52 


a, =o [V3+6, Me VB, | 

Bz = o[13+6, - VR] 

p = V (3+ 6) 
a,+x 

y = | a2-*| 


“Type le 
yee” [ral)”™ 


ZS 


%i=o B, <4.0 


132 STATISTICAL METHOD 


(L) The point corresponding to the line distribution 
By = .32, Be 2 eh, 
This is a point of change of types. On the line to the left of 
this point distributions are Type [X-1 and to the right Type 


IX-2. 

(E) The point corresponding to the exponential distribution 

Bi = 4.0, B2 = 9.0 

This point, which is well off the chart as drawn, is at the inter- 
section of Type IX-2 and Type III lines. Type IX-2 curves 
become Type X curves at this point and Type XI curves 
beyond it. Type III-i curves become Type X curves at this 
point and Type III-j beyond it. The exponential is therefore 
located at the juncture of Types I-i, I-j, III-i, III-j, VI-i, 
VI-j, [X-2, XI. 

There are at least five salient one-constant distributions, 
three of them, (1/7), (R) and (JN), representing symmetrical 
distributions and two of them, (L) and (£), constituting 
division points on the one line that divides i from j curves. 

Excepting the special points noted, points upon any of the 
lines in the diagram correspond to two-constant distributions. 

Types II-u, II-i, VII. The line 


pi =0 

represents three types, Il-u, II-i, VII, in addition to the 
special points (7), (R), (B) and (N). Following Pearson, this 
line would be a boundary of “‘possible’’ distributions. 

Two-Category Type. Another boundary would be the line 

Pier ital =F 

Looking upon distributions along this line as limiting cases of 
Type I-u distributions, it is seen that the equation representing 
them involves exponents which are infinite. For this reason 


no equation for this type is given. 
Types VIII, [X-1, [X-2, XI. The line 


Bi (8 Bx — 9 Bi — 12) (B2 + 3)? = (10 Bs — 12 B; — 18)? (4 Be — 3 Bi) 
represents Types VIII, [X-1, IX-2, XI in addition to the 
special points (R), (L) and Type X, or (EF). This bi-quadratic, 
which we will call f, divides, on the one hand, the u-shaped 


CHART XIX — Continued 


Type I-u 
Boundaries :TwoCate gory 2,40 Above f=o0 


ream aay ray” 


Ke, fo Inside f=0 


Type T-i 
Boundaries: Types IX! K,<o Below f-o 
IX 


‘Type Zt o>,) 100 Below f= 
Boundaries . Types ake . s se ie oa ° 


Be UY 


Type VF j e2 yk, ) 100) Inside 1-0 
Boundaries: Types UT; 


‘Type WZ 
Boundaries: Types Y 


134 STATISTICAL METHOD 


curves from the j-shaped, and on the other hand, the j-shaped 
from the i-shaped. All j-shaped curves lie within the arms of 
this bi-quadratic. 

Type XII. The line 


5 Bs — 081 —9 =(0 


represents Type XII curves, which are j-shaped throughout 
the entire length of the line. In addition the special point 
(R) is on this line. 

Types III-i, III-j._ The line 


2 Pi 3 EO 


represents Type III-i between points (N) and (£) and Type 
III-j beyond point (£). Containing as it does the two impor- 
tant points (NV) and (£) and all points on the straight line 
connecting them, it is a very important type and, considering 
that it has but two parameters in addition to N, the popula- 
tion, it fits in a quite remarkable manner a large number of 
skew curves. Further characteristics of this type are pointed 
out later. 
Type V. The line 


4 (4 Bs — 3 Bi) (2 B2 — 3 Bi — 6) = Bi (82 + 3)? 


(Identical with x2 = 0) 


represents Type V, composed entirely of i-shaped curves, in 
addition to the special point (VV). 

This completes the points and the lines. Points anywhere 
in the regions between lines correspond to three-constant 
distributions. 

Type I-u. Composed entirely of u-shaped curves varying 
all the way from the Two-Category type to Type VIII. 

Type I-j. Composed entirely of j-shaped curves. This 
region might appropriately be divided into two types, I-j-1 
and I-j-2, depending upon which side of the Type XII line the 
point is located. 

Type I-i. Composed entirely of i-shaped curves varying 
from Type IX to Type III. This is the only type area which 
is finite, as Type II, Type [IX and Type III lines completely 
bound this region. 


FITTING OF CURVES TO DISTRIBUTIONS 135 


Types VI-i and VI-j. Type VI-i, composed entirely of 
i-shaped curves, lies below the Type III line and also below 
Type XI line. Type VI-j composed entirely of j-shaped 
curves, lies below Type III line and above Type XI line. 

Type IV, composed entirely of highly leptokurtic i-shaped 
curves. This region lies below type V line. Below the line 

8 B2 — 15 B1 — 36 =O 

is a region in which the probable error of the fourth moment is 
infinite, but it is not uncommon to find data which yield a 
(61, Bz) point below this line. In such case one of the out- 
standing features of the distribution is this very fact of an 
infinite eighth moment in the fitted curve, which is the cause 
of the infinite probable error of the fourth moment. Other 
significant features of the distribution may be determined 
from lower moments than the fourth, which continue to have 
finite probable errors for some distance below the critical line 
given. Pearson has named the region below this critical line 
the heterotypic region. As I understand the heterotypic to 
include bi-modal distributions I consider the designation inapt, 
as I can discover no evidence suggesting bi-modal tendencies 
in Type IV distributions. At present it is a sort of no-man’s 
land. Itis conceivable that there may be lines in it, correspond- 
ing to two-constant distributions not involving the fourth 
moment, and therefore determinable. There may also be 
unique points not involving either the third or the fourth 
moment. For one, the point (6; = 0, Be = 9) may be con- 
sidered such. The equation of this curve is 


Pe peo 
It is the Type VII curve having the smallest possible integral 
exponent, and is completely determined by moments below 
the third and fourth. Furthermore, the probable error of the 
second moment, or standard deviation squared, is finite 
although the point (62: = 9) is exactly twice as far down the 
Type VII line as the intercept (62: = 4.5) of Pearson’s critical 
line with the Type VII line. That this curve is not exceptional 
is obvious from the drawing of it given in Chart XIX, A. 


136 STATISTICAL METHOD 


Section 39. Tue Fittinc or THE Most Important TYPES OF 
CURVES 


The normal distribution. The equation of this curve from 
the mean as origin is 


The constants involved have been defined. The population, 
N, and the standard deviation of the distribution a, are all 
that are needed to determine the normal curve which best 
represents given data. 

Type II. The equation from the mean as origin is 


x?\m 
y = w(t - 5) 2 atyedoois sie oo oeeerera TSH 
in which 
erie are 
6 — 2 Be 
a = 3 the range = | 2 wabe 
N3 — Ba 


NT (2m + 2) 
a22m+1 {tl (m + 1)}? 


yo = ordinate at the mean = 


The I function may be evaluated without resorting to tables. 
First, if x is greater than 1, the following equation holds, 


T(@-+1)=xTx (function reduction formula)........[74] 
Second, if « is an integer greater than 1, 
T(+1) =x! (iy Linetion (ofsansintecen) arene 7S) 


Third (Forsyth, quoted by Pearson 1901 supplement to 189s), 
as a close approximation to the value of the function, may be 
given, 
= a IS ae (Forsyth evaluation 

D@ +1) Sees of ane I function) . [76] 
To quote from the reference cited, “If x be large the error is 
less than 1/(240 x’) of the whole.’ Even for an x = 1.s the 
error is only in the neighborhood of 1 per cent. We may, 
however, first use the I reduction formula and then Forsyth’s, 
for small values of x, resulting in as high a degree of accuracy 
as may be desired. For example, 


Tir5 = 


[25° = eT 
5 


“Oe 14.5 id a 
1.5 Te D 


‘5. 5 X25 X gs. (Ls esx 5 eee 


FITTING OF CURVES TO DISTRIBUTIONS 137 


The evaluation of I'5.5 by means of Forsyth’s formula is 
highly reliable so that I 1.s is readily obtained. 

With the determination of y) the general solution of the 
Type II equation is completed. 

Frequently, with immaterial loss in the excellence of fit, 
m may be set equal to the integer most nearly equal to 
(5 Bz — 9)/(6 — 2 62) and the resulting equation will be much 
simpler to plot. The use of an integral value for the exponent 
is equally serviceable in other types of curves. Whatever 
value of m is used as the exponent, is of course also to be used 
in the equation giving yp. 

Type VII. The equation from the mean as origin is, 


9h ame en [7] 
x 

(:+3) 

Hien = @ 
he = 6 2.5) oO 

2 w2 Bo 
4 ba 3 

NUm 

Shi 


oV2 7 V(m—4k)Vm—3 
Note that us is not involved in the solution of the equations of 
Types II and VII. Types III and V do not involve ws. 

Type III. The equation from the mode as origin is, 


eae x \?P 
y=we % 1+2) caayalon ee penton ees Cee [78] 
a 
p_ 2m 
a ps 
e= u(%)—(3) 
- N peti = p I 
 "aeTG+i) al b+1) 


Cue DP. 
Mode = Mean = 
Pearson (quoted in Duffell 1909) has shown that 
-- [79] 


el = .3990899 + 3 log p + .080929 sin 2 
is a highly accurate equation for values of p > 2. It is ac- 
cordingly a simple matter to determine y) by the aid of this 


equation. 


138 STATISTICAL METHOD 


A fitting of the distribution, not involving w;, may be ac- 
complished by utilizing the fact that the difference between 
the mean and the mode equals a/p. Determine this distance 
by the use of formula [4] or [4-a], thus yielding a/p. The 
constants a, ~, yo are then found as above, completing the 
solution. 

Type V. The equation from the boundary as origin is, 


y = ye-yx-b | dav Ay ataleaet erate eee LOO 
= 8 4VB, +4 Plus sign of radical 
Bae LEAs Bi to be used 
y =a (p — 2) Vp — 3. Sign of radical is the 
same as that of ys. 


ee 
=r (2) 
Distance from origin to mean = o Vp — 3 
27 
Mean — Mode = ————. 
= b (b — 2) 


Section 40. THE BEARING OF CURVE TYPE: UPON STABILITY 
oF DISTRIBUTION 


With the visual pictures of these curve types in mind we 
may proceed to a discussion of the bearing of type upon 
stability of distribution. 

Mention has been made of the fact that the point (6; = — 4, 
Bo = — 3) is a very unique point. The equation of every 
significant line in the chart except the line 6; = 0, passes 
through this point. Many interesting relationships are made 
very clear by shifting the origin to this point. 

The region enclosed within the Type II-VII and the Two- 
Category lines correspond to “real” distributions. A real 
distribution, as implied by the steps in the Pearson method, is 
one having the first four moments finite in addition to a finite 
total population. Other features, which one might insist 
should be finite, are not infrequently lacking. All of the u- 
shaped curves which are asymptotic to their upper or lower 
limits have infinite ordinates at these limits, though their 
areas are generally finite. One desirous of defining a real 
distribution in narrower terms than has Pearson would prob- 
ably exclude these. 


FITTING OF CURVES TO DISTRIBUTIONS 139 


In speaking of infinite positive moments, ordinates or popu- 
lations, the reader will of course understand that no obtained 
distribution can possess such a feature. Attempts to fit a 
smooth curve to a distribution more frequently than otherwise 
result in obtaining an equation with some infinite characteristic. 
Accordingly a reference to a distribution with such an infinite 
property is to the fitted curve, and though this infinite feature 
is not characteristic of the specific data in hand, it may be 
entirely descriptive of the total population of which the given 
data are a sample. In dealing with data in which certain 
reciprocal functions are infinite we will likewise be speaking of 
the fitted curves. 

Certain of the Pearson types have infinite characteristics, 
ordinates, abscissas, and moments. As “real’’ distributions 
these might be looked upon as shortcomings. The point is, 
simply, that different limits as to the extent of distributions 
will exist dependent upon what is included in the concept 
“distribution.” If negative frequencies are included, and it 
is to be hoped that a satisfactory physical meaning can be 
given to them so that they may be included,* then the limits 
of distributions greatly exceed the region bounded by the 
Type VII and the Two-Category lines. On the other hand, 
were one to restrict his concept to curves having finite eighth 
moments, the critical line (us = ©) would be a limit. The 
writer would think it logical either to restrict the concept to 
such as have all their moments finite, or to throw the field wide 
open and include everything which has as much as one de- 
terminable feature, such as the population, any one ordinate, 
any one moment, any one derivative, etc. 

The acceptance of this broader definition of “distribution”? 
immediately suggests the study of distributions for the purpose 
of ascertaining the nature and number of features which are 
finite, i.e. determinable. This has been done with reference 
to the moments of the various types of curves with results as 
shown in Chart XVIII. If a positive moment (fy x" dx) is 
finite when taken about a certain point, it continues finite 
when taken about any other point a finite distance from the 


* For a suggestion as to this see Chart XIV and discussion of Sec. 8. 


140 STATISTICAL METHOD 


first. In dealing with negative, or inverse moments 
(fy x" dx), however, the point of reference determines whether 
it be finite or infinite. The only natural point of reference seems 
to be a limit of the distribution. It is found, for all the Pearson 
curves, that if u_,= 0, then um+s), where A as well as n 
is positive, is of necessity also infinite, so that moments have 
been taken around that end of the distribution which shows a 
breakdown, or infinite value, in the lower inverse moment 
(u-2 is called a lower, or smaller, negative or inverse moment 
than pz, etc.). 
The method of determining which are infinite follows from 
the fundamental differential equation, which is 
dy ~~ _ aAtax (Pearson’s differential equation for all 
ydx cy + cox + c3x? types of uni-modal distributions) . . [81] 
If the roots of (c, + cox + c3x? = 0) are imaginary the limits 
of the curve are + , and if they are real the distribution lies 
between the values given by the roots. We may illustrate 
the method of determination of the moments by means of a 
Type I curve, To determine the infinite negative moment we 
will first shift the origin to the left extremity of the distribution. 
Let csv ? + cox + €1 = €3 (wx + bi) (x — be) 


oo by 4- be 
a2 = be; 
a, — Q2b1 = ac; 
2=>X + by 
Then the equation from the new origin is 
dy _ a-+bz 
yds a gh [82] 


and the limits are z = 0, and z=c. Multiplying by 2" and 
clearing give 

Saysndz + Sbyzrtids = f(— cantr 4 gntr 4 gn+2) dy 
Integrating, 


aMn + bMn4r = [(— contx + emt2) y] — f—c (nm + 1) yands 
Z=0 
—S (n + 2) yort2ds 
[a —c(m + 1)] Mn + (6 ++ 2) Mn4i = [(— corte + gn+2) av Bo) 


oO 
The M’s or moments of this equation differ from the usual 
moments, y’s, only in that they are not divided by N, the 


FITTING OF CURVES TO DISTRIBUTIONS 141 


population. The two terms in the left hand member are 
functions of the entire distribution, while the right hand 
member is a function of the limits only. Whenever the 
coefficient (b+ 2 + 2) equals zero, then M41 can vary at will 
without affecting M,. Therefore that value of m which makes 
this coefficient zero locates the moment, M,4+;, which becomes 
infinite. This is the procedure that could be followed in 
finding out where the positive moments break down, but in 
dealing with negative moments M, becomes infinite before 
M41, so that [a —c(n + 1)] is then the coefficient that con- 
cerns us. It remains to express a and c in terms of 6; and fy. 
Let — a/c = m and b = m+ m., then the integral of the 
differential equation [82] is 
Yay atte (Cli 12) UA vara ete taiob evel elelistevetonsteeks [84] 
and the differential equation is, 
—c(m+n+1) Mn t+ (m + m2+n + 2) Mn+ 
= [(— com tt 4 gnt2) vI aot [83 a] 


If the origin is taken at the other boundary the differential 
equation is the same as above with m, and me interchanged. 
The constants for any given distribution, m1 and m:, are functions 
of B; and ~) (Pearson 1895) and can be expressed concisely if 
the following substitutions are made: 


y=Bit4 
A= B2+ 3 
=A N= °Y, 
f= 548 — 67 
B= ay ae 
The two roots of the following equation give the two values of m, 
km =jgHt Nee entre (O51 


For the determination of the first inverse moment which breaks 
down we are concerned with the value found by using the minus 
sign of the radical. Values of m along a ray through the 


point (6f1 = —4, B= — 3) may be readily determined. For 
example, for Type II line, k = o, and 
4 
Bis 


142 STATISTICAL METHOD 


For Type XII line, 7 = 0, and 


3m? 
I — m* 


pi = 


For the line 11 y — 8A =0, 


40 (2m — 7)? 
121 (8 — m) (m + 1) 


pi = 


As the M,, inverse moment breaks down when n = —m —1, 
we may write for the Type III line, 6, = — 4/n, substitute 
—1, — 2, — 3, etc., values for ~ and ascertain the f;’s or the 
points along this line where the successive inverse moments 
become infinite. A similar procedure for other rays enables 
the plotting of the entire region, as shown in Chart XVIII. 
Transferring the origin to the mean, so that positive moments 
will not become infinite merely due to the boundary being an 
infinite distance from the mean, and finding when the coeffi- 
cient of M,4,; equals zero, gives the limiting values for the 
positive moments. These are more simple functions of 6; 
and #:, all being straight lines passing through the point 
(61 = — 4, Be = — 3). Going, on the chart, from below up, 
these rays become more and more dense until the limiting 
Type III ray is reached; just as, going from above down, the 
negative moment-breakdown lines become more and more 
dense until the limiting Type V line is reached. Special note 
needs to be made of the lines for moments wo, 1, Me, 3, aNd ps4. 
The last three of these moments are incorporated in the very 
axes, 6; and Be, of the chart. Lines determined from the coeffi- 
cient of My41, showing where these moments break down, 
would show, as might have been anticipated, that the rays 
for we, Ms and py lie outside of the region described by Pearson 
as that corresponding to real distributions. The line for 
(4 = ©) when the coefficient of M,41 is used lies within the 
Pearson possible region, and the line for (u; = «) lies at the 
boundary of it. The population, uo, is not necessary to the 
calculation of 8; and B. so that the fact that it lies within this 
region is not inconsistent with the definition of the axes. 
However, wy and pu, are smaller moments than those involved 
in 6; and fb: and it may be necessary to determine their points 
of breakdown from the coefficient of M, and not of My 44. 


FITTING OF CURVES TO DISTRIBUTIONS 143 


Pending further study of Type I-u distributions I will not 
attempt an answer to this question or a description of distribu- 
tions having infinite zero and first moments. 

If the coefficient of M,,41 is examined with reference to the 
negative values of m for which it becomes zero, rays above 
Type III are located and these become more and more dense 
as Type III is approached. These have not been plotted, as 
earlier points of breakdown of the negative moments are located 
by dealing with the coefficient of M,, but it is worth while 
noting that, judging by the coefficient of Mn+41, Type II distribu- 
tions are the only ones which do not possess certain infinite 
positive or negative moments, i.e., certain elements of in- 
stability. If these unplotted lines should prove of any signifi- 
cance Type III distributions become unique not only because 
of possessing finite positive moments, but also because of the 
finite nature of whatever the inverse functions are whose 
points of breakdown are given by the coefficient of Mn+. 
If, then, finite positive moments are of most importance II 
is the most stable of all the types; however, should finite 
negative moments be of greater importance than positive, 
Type V would be the most stable; and if the possession of 
both finite positive and negative moments is material then the 
normal distribution is the most stable curve within all the 
types. 

It has for some time been known (Pearson, 1905), that if, 
by means of the first four moments, a curve is fitted to a 
distribution having a (f1, 62) point in region VI or IV, certain 
of the higher moments of the fitted curve are infinite. Pearson 
and Rhind (1909, pp. 130 and 134) have apparently interpreted 
this to mean that for such distributions moments higher 
than the fourth are needed for an adequate description of the 
data. This, however, hardly seems to me the most significant 
point of view. We can adequately and completely describe 
the sample collected by calculating and recording enough of 
the higher moments, but as Pearson has himself pointed out, 
this would scarcely yield valuable information as to the popula- 
tion of which the data are a sample because the probable 
errors of these higher moments become extreme. The really 
important conclusion to draw is that data, such that the 


144 STATISTICAL METHOD 


sample drawn gives a (61, 62) point in the Type VI or IV 
regions, are of such a nature as to have indeterminate higher 
positive moments. The lines labeled us = 0, us = ©, ++ *p-8 
= 0,p-3= 0, etc. on Chart XVIII indicate where, judged 
by the first four moments, these higher positive and negative 
moments become infinite. Suppose that for a given ((:, 62) a 
fitted distribution is obtained for which p= «©. Such 
analysis as I have been able to make leads me to infer that a 
few added moments in the fitting of the curve would not be 
expected to materially change this, and that some moment 
not far from poo will break down in any case. 

These phenomena of instability of certain types of distribu- 
tions are not mere oddities of the equations representing the 
types. Either coefficient of the difference equation connecting 
the moments may be written in the form, 


¢ (Bi, B2) n + f (Bi, B2) = 0 


in which ¢ and f are definite functions of the p’s. Accordingly 
the breakdown of a moment is a function only of the moments 
involved in the #’s. In other words, were we to fit a Type I 
curve and find that the n-th positive or negative moment 
became infinite, we could not improve the situation by fitting 
a Type II curve to the same data. The breakdown is not a 
function of the particular Pearson type chosen, but of the 
data, or of the differential equation back of all the Pearson 
types. That it is hardly the latter may be shown. 

Had Pearson decided to use the first five moments in fitting 
curves it would have involved, in addition to the usual ,; 
and £, constants, a third which we may call y. A solid having 
three axes, 61, 6. and y, would represent all the types just as 
the plane with axes £1, B. now represents them all. The most 
serviceable function to constitute the third variable y is not 
immediately obvious, but there would be certain advantages 
in defining y as the difference between the 63 (83 = psus/p4) 
given by the data and that derived from moments lower than 
the fifth by means of the present differential formula [81]. 
Wken so defined, if y = o a distribution would be represented 
by a point on the two-dimensional (6, B:) chart. It is barely 
conceivable that there might be a (61, 6s, y) line for which all 


FITTING OF CURVES TO DISTRIBUTIONS 145 


the positive and negative moments are finite. If there is such 
a line it cuts the (61, 62) plane in the Normal point and nowhere 
else, so that the normal distribution loses none of its peculiar 
stability. The existence of such a line seems unlikely in view 
of the fact that there is no line (as opposed to point) in the 
(61, 82) plane for which all the moments are finite. Otherwise 
expressed, had two moments only been used to derive the 
equations of curves, the special points on the chart could have 
been found and the normal distribution would have been the 
only one having all its moments finite. Had three moments 
been used the special lines in the chart could have been found, 
but no line would represent distributions having all their 
moments finite, the single Normal point again possessing this 
characteristic. Again, by the use of four moments, no area, no 
line, but merely the one Normal point is found for which all 
the positive and negative moments are finite. Accordingly it 
seems unlikely that the addition of a fifth moment would result 
in any extension of the distributions having all their moments 
finite. 

The preceding discussion suggests that it would be futile to 
add an x3 term in the denominator of the differential equation, 

dy eS ay + ax 
ydx cr + Cox + c3x? 


The addition of an x? term in the numerator introduces bi- 
modality and carries the problem into an entirely different 
field, corresponding, in all probability, to the operation of two 
opposing trends, instead of a single one such as we are here 
considering. 

The only conclusion which seems to me to follow from the 
situation as described is that the weakness in distributions, 
evidenced by the existence of certain infinite moments in the 
fitted curves, lies in the data. This far reaching conclusion 
is supported by (1) the fact that an extension of the differential 
equation to include additional moments will, apparently, some- 
times change, but not materially better the situation; and (2) 
by the known illustrations of instability which may be drawn 
from economic, psychologic and biologic fields. 


146 STATISTICAL METHOD 


Section 41. ILLUSTRATIONS OF UNSTABLE DISTRIBUTIONS 


Two distributions have come to my attention which are 
difficult to interpret, except as being unstable Type VI distribu- 
tions. 

The first is of price ratios, see Chart VIII, each ratio 
being the quotient of a price in a certain year divided by the 
price of the same commodity the preceding year. The distribu- 
tion is very peaked and somewhat skewed and gives a ((1, f2) 
point so far down the chart that the fourth moment has an 
infinite probable error when the differential equation method 
of determining it is followed. The apparently puzzling ques- 
tion is how the curve fitting method can be so far wrong as to 
positively describe this distribution as one having an infinite 
feature. Recent study of similar price data shows that the 
fitted curve was undoubtedly correct and that the data did 
actually have such an infinite characteristic. Certain com- 
modities for sale in 1917 were not purchasable at any price in 
1918 and the series of 1918 ratios covered only such 1917 
commodities as could be purchased in 1918. In other words, 
such price ratios as were recorded were in truth but a part of 
an unstable distribution, and being such they gave evidence 
that an occasional infinite price ratio was to be expected. 

The second series is such as may be collected by any experi- 
menter. A certain student was a subject in a reaction time 
experiment. The stimulus consisted of a spoken word and the 
reagent was directed to reply with the first word coming to 
mind. The series of reaction times revealed a Type VI distri- 
bution with a fourth moment having an infinite probable error 
when determined from the differential equation. This reagent 
was not tested further, but other reagents have been, with the 
result that a mental confusion or blocking has been found to 
occasionally occur, and to be so pronounced that the reagent 
has refused to react at all, i.e., the reaction time for that 
particular stimulus has become infinite. I have no doubt that 
were it possible, without changing the conditions, to continue 
the experiment with the first subject, sooner or later a similar 
blocking would be found, so that here again the probability 


is that the infinite higher moment is a true description of the 
situation. 


FITTING OF CURVES TO DISTRIBUTIONS 147 


According to Angell (1907), who points out that judgments 
of equality between two differing stimuli cease to constitute a 
homogeneous series if the stimuli differ by too great an amount, 
the same sort of condition holds generally in psychological 
threshold experiments. That is to say that reactions from such 
widely differing stimuli will yield distributions having unstable 
tails, or, what I would take as the statistical equivalent, Type 
IV or Type VI distributions. The use of the curve fitting 
method to determine the degree and nature of the instability 
in threshold experiments is suggested, but it suffices for our 
immediate purposes to note that psychologic as well as economic 
data occasionally yield distributions actually possessed of un- 
stable tail functions, or in other words, infinite positive moments. 

These illustrations point the possibility of the existence of a 
causal relationship which is determinable from a knowledge 
of the positive, and probably also negative, moments which 
become infinite. In fact, the order of the breakdown moments 
may prove a touchstone to the discovery of causal relation- 
ships. The method at present available for locating these 
critical moments is that of utilizing the first four positive 
moments from the actual data to determine a differential 
equation connecting moments. Having this equation the 
critical moments may be located immediately. 

Slight shifting of the origin entirely changes the situation 
with reference to the inverse moments, so that, (a) it is either 
impossible to utilize inverse moments, (b) the conditions of 
the problem must give the limit with absolute definiteness, 
or (c) more definite features, such as the positive moments, 
must be used for the indirect determination of the limits and 
of the inverse moments around these limits. That method 
(c) will result in determinations with relatively small probable 
errors in case the lower negative moments are the critical 
ones is apparent from the appreciable distances apart of the 
p—n lines of Chart XVIII. 

Though the laws controlling biologic phenomena have proven 
less easily and definitely determinable than many of those of . 
physics, nevertheless the distributions of traits resulting from 
biological forces can readily be determined and examined. Is 
it not reasonable to think that, whatever else evolution may 


148 STATISTICAL METHOD 


involve it certainly involves a trend toward stability? If it 
is a development through laws represented by positive mo- 
ments, its limit is a Type III distribution; and if through laws 
represented by inverse moments, its limit is a Type V distri- 
bution; and if both are involved, the only final limit is the 
normal distribution. This approach may be peculiarly valu- 
able in studying evolution and it should not be a difficult matter 
to test it. Distributions of shell and skeletal structure of past 
ages can be made. Should it prove a fact that forms existent 
in the past giving distributions different from Types III and 
V have disappeared, and that those close to these Types are 
still represented by extant life, it would be complete support 
of this point. 

We may note that the peculiar stability of Type III as 
judged by the existence of determinable positive moments is in 
harmony with the unique facts of correlation which Pearson 
has pointed out as belonging to this type. This is the only 
type in which ‘each contributory cause group is of equal 
valency and independent.’”’ The writer may have overlooked, 
but at least he has not found, in Pearson’s contributions a 
satisfactory explanation and elaboration of ‘‘cause groups.” 
He, however, interprets them as analogous to separate chromo- 
somes, each of which may affect a single character, or to separate 
climatic and economic conditions each of which may affect 
a given food product, etc. If cause groups are not independent, 
so that a measure of a certain magnitude implies other magni- 
tudes positively correlated with it, we have a situation which, 
from a priori considerations, one would expect to correspond 
to a trend, or tendency operating to pull measures in a certain 
direction, possibly entirely out of the distribution. It may be 
that a sufficient number of counteracting pulls, or vectors, 
could exactly balance each other, resulting in a condition 
identical with one not involving any pulls whatsoever, so that 
it seems equally reasonable to look upon Type III distributions 
as those in which there is a perfect balance between positive 
and negative correlation tendencies, thus revealing a zero 
correlation, or as distributions in which the pulls between 
elements are all zero. Whichever view is taken the significant 
result remains the same; that distributions which differ from 


FITTING OF CURVES TO DISTRIBUTIONS 149 


Type III thereby give evidence of the existence of uncompen- 
sated correlation between cause groups,—and of lack of 
stability since certain moments are indeterminate. 

The determination of the specific nature of the correlation 
between cause groups in Type V distributions is a promising 
field of research. This type, holding as it does the same 
position with reference to stability of negative moments that 
Type III holds with reference to stability of positive moments, 
may possess some equally unique and stable characteristic 
with reference to negative product-moments as that possessed 
by Type III with reference to positive product-moments. 

In the light of all the facts presented it would seem that 
evolution must be a trend toward the normal distribution. 
Also, dependent upon the causal forces operating, it would 
seem that subsidiary trends would be toward the three lines 
running into the normal point. If the causal forces can be 
expressed as positive moments, changes in distributions below 
Type III in the direction of Type III would mean ever greater 
stability, i.e., evolution. If the causal forces can be expressed 
as negative moments, changes in distributions above Type V 
in the direction of Type V would mean evolution. Balanced 
or symmetrical distributions show a peculiar stability in that 
all odd moments are zero. If stability of this type is the goal 
of a certain line of evolution, the trend would be toward Type 
II or Type VII. Finally, a certain development (biologic, 
economic, psychologic, or what not) having reached one of 
the three subsidiary goals, Type II or VII, Type III, or Type V, 
further advance, to insure stability of a still greater order, 
would be along the line toward the normal point. 

The possession by an individual of a trait of such magnitude 
as to lie outside of the distribution given by the other members 
of the species ordinarily carries with it the elimination by 
death of the individual,* hence stability in trait 1s intimately 
connected with stability in species. 

Only in case a trait is operated upori by such influences as 
result in the measures of the trait falling into a normal distribu- 
tion can it be said that there is complete stability, or that the 


* Cf. the traits possessed by lethal drosophila melanogaster. 


150 STATISTICAL METHOD 


race or species possessing it gives evidence of a self-contained 
permanence. 

Clearly if this analysis is correct, the evolution of a bisexua’ 
type of life would be as follows: (1) two entirely distinct traits 
which we may call male and female; (2) an occasional modifi- 
cation of the two, each in the direction of the other, giving 
a u-shaped distribution; (3) a building up of a common ground 
between the extremes, giving a limited range Type II-i distri- 
bution; and (4) a further weakening of the extreme character- 
istics until they become of infinitesimal importance in com- 
parison with the common ground between, resulting in a 
normal distribution. 

Following the lead of the argument we find the human species 
much further developed in certain parts of its makeup than in 
others. As illustrations of the four stages note (1) primary 
sex characteristics; (2) secondary sex characteristics; (3) mus- 
culature; (4) intelligence. In concluding this chapter let me 
emphasize the promise that lies in an experimental study of 
evolution, utilizing the facts of distribution types. 


CHAPTER, VIII 
MEASURES OF RELATIONSHIP 


Section 42. THe PRoBLEM OF CONCOMITANT VARIATION IN 
THE SCIENCES 


The determination of the law underlying concomitant varia- 
tion is a problem common to all the sciences. The physical 
sciences have a great advantage over the social and biological 
sciences in that (1) errors of observation and measurement are 
usually very small in comparison with the measures involved 
and (2) fewer factors are ordinarily present. In measuring 
some intellectual capacity of a group of children, it usually 
happens that the probable errors of the test scores obtained 
are greater than half the standard deviation of the scores of 
the group. Obviously any relationship between two capacities, 
each measured with no greater reliability than this, will be 
clouded bv the errors of measurement. This is serious enough, 
but it is not the only difficulty. In measuring the effect of 
gravity, physicists can ordinarily assume that ten pounds of 
lead and ten pounds of iron will act in a similar manner. But 
in measuring intellect, food prices, etc., to say that one reagent, 
one commodity, etc., is equivalent to another with respect to 
the function being examined. is usually questionable. Ac- 
cordingly, where the investigations of physics lead to the estab- 
lishment of “laws,” those of the social sciences ordinarily lead 
to the discovery of ‘‘tendencies.” Relationships between two 
psychological, biological or social factors frequently depend 
upon a number of causes, each more or less independent, and 
no one of which is so important as to dominate the situation. 
Under these conditions, the relationship tends to be rectilinear. 
In other cases, where the true relationship is not rectilinear, 
large errors of measurement will lessen the strength of the 

I5I 


152 STATISTICAL METHOD 


measurable relationship, thereby making it more difficult to 
determine the exact nature of whatever curvilinear relation- 
ship may exist. It is also true that relationships which are 
intrinsically curvilinear when determined over a range of the 
two variables from very low to very high, may show practically 
rectilinear relationship throughout a short stretch of the range. 
For all the reasons stated, a measure of relationship based upon 
the assumption of rectilinearity is of great importance. Even 
in the case of known non-rectilinear relationship it is of much 
value as a point of departure. The balance of this chapter is 
devoted to a discussion of Pearson’s product-moment coeffi- 
cient of correlation, the ‘‘best’’ measure of mutual implication, 
if relationships are rectilinear. 

The most fundamental properties of this measure of relation- 
ship were discovered and presented graphically by Francis 
Galton from 1877 to 1888. Galton’s investigations had to do 
with the inheritance of traits, and certain of the terms which 
he used would hardly have arisen if the development had 
involved other data. For example, the symbol ‘“‘r”’ was a 
measure of the ‘‘reversion,”’ such, for example, as offspring 
upon mid-parent (a mid-parent measure is the average of the 
measures of father and mother). Later, Galton used the 
terms ‘‘regression’’ and ‘“‘co-relation’’ and called the measure 
the “Index of Co-relation.”’ Weldon very properly calls this 
measure ‘‘Galton’s Function’’ and Edgeworth in 1892 gave it 
the name which has survived, ‘Coefficient of Correlation.’ 
Pearson (1920 notes) has pointed out that the product- 
moment function of Bravais bears but a resemblance in form 
to the product-moment coefficient of correlation. Whereas 
Bravais started with observations which were assumed to be 
independent, and in treating them obtained derived measures 
whose product-moments did not equal zero, Galton started 
with the epoch-making concept that the original measures 
were dependent. The Bravais treatment leads nowhere so 
far as correlation theory is concerned, because the measures 
which are correlated do not constitute original data, nor 
functions the correlations between which are of any moment 
on their own account. Partial correlation analysis leads to 
independent measures, having given related original scores; 


” 


MEASURES OF RELATIONSHIP 153 


which is exactly the reverse of the Bravais or Gaussian develop- 
ments. Galton alone seems deserving of being called the 
father of correlation. 


Section 43. FinpINGsS RESULTING FROM GALTON’S 
GRAPHIC TREATMENT 


Galton s procedure, based upon medians and quartile devia- 
tions, has given way to the more accurate one involving the 


product-moment formula, 
22858) 
Nojo2 


developed by Pearson. 

We cannot do better than to use Galton’s data in deriving a 
measure of correlation. Galton obtained the heights of parents 
and the heights of children, and drew up a ‘‘correlation table” 
or “scatter diagram” showing the relationship between the 
two. All female heights were multiplied by 1.08 to make them 
comparable with male heights. This procedure is not the 
most sound, but in this problem leads to no material error. 
Letting X1, X2 represent male and female heights, o1, oo, their 
standard deviations and M,, M, their means, it would have 
been better to have reduced each female height to a com- 
parable male height by the equation 

Comparable male height = M, + (X2 — M2) o1/o2 

The discussion which follows will assume that the more reliable 
method of transmuting female into male heights was followed 
and also that the mean was used throughout. Presumably 
Galton used the median, but no fundamental difference in 
treatment followed from such use, it simply being a slightly 
less reliable procedure. Galton’s diagram contained the data 
given in the accompanying correlation table or scatter diagram, 
Chart XX. Deviations being measured from 681 inches, which 
is a small fraction of an inch away from the true means, are 
labeled £ and ¢ instead of x and y, but no account of this slight 
difference is taken until the calculation of Section 45. From 
just such data as given, in fact it is likely that these identical 
data were involved, Galton inferred certain relationships which 
we now know hold with every normal correlation surface 
[Formula 87]. 


154 STATISTICAL _METHOD 


(a) A plot of the means of the vertical arrays (columns) as 
shown by the X’s shows the “reversion” of offspring upon height 
of mid-parent. Thus if the mid-parent height is 23 inches 
above the mean the average or most probable height of offspring 
is 14 inches above the mean. 

(b) The line connecting these means may be closely repre- 
sented by a straight line through the origin or intersection of 
the means of the two distributions. This is the line showing 
the regression (or ‘‘reversion’’) of offspring upon mid-parent. 


CHART XX 


Correlation Between Heights of Mcparent and Ort spring 


Heights of Adult Children “Expressed as devia- 


tions from the mean height, 68% Inches 


wa 

=) 

py 

© 

— 

S 

iE 

Ss 

= 

jo 

°o 

a) 

Bes 

— 

oO 

® 

a 
—a 
“373 

y= 38 2740, 16/8 

DE =2€ DAS 


9g 
8: fosof20=5. 
8971 1029? 300 423 53 Si 4H 850 BO 29 prox? 
“7 sl HO 4 = f 19 40 32 44 £5. 


SFES 


(53 RIT 200 es WM 19 Ro R60 308 207 [ors-TES 


(c) There is a reversion or regression of mid-parent upon 
offspring. This would be represented by a straight line pass- 
ing approximately through the o’s. Thus for every correla- 
tion table there are two regression lines. 


(d) The slopes of these two lines are equal, provided the 
standard deviations of the two distributions are equal. 


MEASURES OF RELATIONSHIP 155 


(e) If standard deviations are equal, this slope varies between 
zero and one (Galton did not suggest the existence of negative 
correlations), and may be represented by the symbol ‘“r”’. 

(f) The standard deviations of the measures found in any 
one array (row or column) are approximately equal and are 
smaller than the standard deviations of the total distribution 
so that if o2 equals the standard deviation of the heights of 
offspring, and o».; the standard deviation of offspring correspond- 
ing to given heights of mid-parent, then 


o79.1 = 07) (I — A) 


where is a positive quantity, also dealing with columns 
instead of rows, 
o*12 = 07% (I — d) 

in which X is the same as before, o; the standard deviation of 
heights of mid-parents, and o,.2 the standard deviation of 
heights of mid-parents corresponding to given heights of 
offspring. The symbol eo, will, in subsequent formulas, stand 
for the standard deviation of an array around its own mean 
and o1.2 (or 02.1) the standard deviation of an array around 
the regression line, but as we are here dealing with homo- 
scedastic rectilinear regression either symbol can be used, as 
Cg —! 01.2. 


(g) There is a simple relation between \ and r. 


Itis,A = 7 so that 
0721 = 0% (1 — 7?) (Standard deviation of arrays 
and from regression line, see 
o7y.9 = 0% (I — 7?) also Section 48).........[86] 


(h) Each array is approximately a normal distribution if 
the total distributions are normal. 

(c) If contour lines for different frequencies are drawn in 
the diagram they constitute a system of similar and similarly 
placed ellipses, the conjugate diameters of which are the two 
regression lines. 

Galton made no claim to mathematical ability but through 
sheer insight into the phenomena of mutual implication made 
these penetrating observations. He carried his conclusions, 
stated in probability terms, as to the nature of the correlation 


156 STATISTICAL METHOD 


surface, to Mr. J. D. Hamilton Dickson (1886), a mathemati- 
cian, who readily wrote down the normal correlation equation 
involving two variables. In our present notation this 1s: 

=i aaa — 27%) (Normal correlation sur- 


N — yr? 2 2 9 ° 
6 i ess CU ee ae ee face: 2 variables).... 


270102 VY = 7 [see 88] 


Galton’s humility, after years of collection of data and subtle 
analysis of the same, in the face of the neat but not involved 
mathematical derivation, is worthy of note by the social 
scientists of this day who scoff at mathematical analysis. 
Upon receiving from Dr. Dickson the solution of his problem 
he wrote (quoted in Pearson 1920 notes), ‘“I may be permitted 
to say that I never felt such a glow of loyalty and respect 
towards the sovereignty and magnificent sway of mathematical 
analysis as when his answer reached me, confirming, by purely 
mathematical reasoning, my various and laborious statistical 
conclusions with far more minuteness than I had dared to hope, 
for the original data ran somewhat roughly, and I had to 
smooth them with tender caution.” 


Section 44. ALGEBRAIC STATEMENT OF GALTON’S GRAPHIC 
FINDINGS AND DERIVATION OF CORRELATION FORMULAS 


Let us consider these discoveries in more detail. Let x, 
the first variable, stand for height of mid-parent, y height of 
offspring, each expressed as a deviation from its respective 
mean. The standard deviations are respectively o; and on, 
while r is the slope of the regression line in the ‘‘reduced”’ 
scatter diagram, — that is, in the correlation table, — in which 
the measures entered are x/o, and y/o2 respectively. Galton 
reduced by dividing by the quartiles, leading to essentially 
the same result as here. The slopes of the regression lines are 
equal, and equal to r. We will shortly obtain the numerical 
value of r by other than the graphic method of Galton. Finally, 
let y stand for an estimated height of offspring, knowing the 
height of mid-parent, and x the estimated height of mid-parent 
knowing height of offspring. With this notation, discoveries 
(a) and (b) together are equivalent to the equation 


SRR (Fundamental form of regression 


y 
o o1 EQUaUION) Serene tn [see 91] 


MEASURES OF RELATIONSHIP D7 


Propositions (c) and (d) are equivalent to the addition of the 
following equation to the preceding 


These are the two fundamental regression equations char- 
acteristic of every regression table, showing rectilinear regres- 
sion. 

Proposition (e) is liable to misinterpretation. If r = 0, it 
implies that there is no relationship, no reversion or regression 
of one variable upon the other while an r = 1 means complete 
mutual implication of the two variables. More loosely stated, 
this latter situation will be described as one of complete mutual 
dependence, or simply dependence of the two variables. The 
student, however, should not postulate causal dependence. 
So far as data are concerned there is no evidence that the 
heights of the parents have any more to do in causing the 
heights ot the offspring than do the heights of the offspring in 
causing the heights of the parents. This is characteristic of 
all measures of correlation. A situation exists and a correla- 
tion coefficient measures the tendency of the pairs of measures 
to be related but gives no evidence whether x is the cause of 
y, y the cause of x, or whether the cause is unknown and lies 
back of both. We think of parents being causal agents in 
determining the heights of offspring, but we do this for reasons 
outside of the scatter diagram, namely, the parents have 
existed earlier than the offspring in a time series. 

Propositions (f) and (g) are of course the result of careful 
collection and study of data, but Galton gave a very simple 
proof of (g). The variability of the offspring generation is 
determined by the variability of the arrays (rows) and the 
variability of the means of these arrays. If A equals the 
distance of the mean of an array from the mean of the distribu- 
tion and, as before, o2.1 equals the standard deviation of the 
array, and if n equals the number of measures found in an 
array, then (no%., + nA?) equals the contribution of a single 
array in the calculation of the standard deviation, of the 
distribution, thus: 

, _ [No.1 + Zn A? 
eee Ng 


158 STATISTICAL METHOD 


Since Yno%».; = No%. and since for any array A equals the 
estimated y corresponding to the given value of x, 


so that ZnA? = Xny? = No», therefore 
os = 072.1 + 07, (Standard deviation of distribution 
in terms of standard deviation 
of arrays and of standard devia- 
tion of means of arrays — recti- 
Linear LEPEESSION) sence lees [87] 


By proposition or discovery (f) 


I 
— DY Nore, = 570.1 = 2 (1 — r) 


N 
and 
Dnh? 9% D x? So 
Nes tok Lorine: 
Accordingly, 
ao, = o%2 (1 — A) + 702 
and finally, 


=r? 
so that the important proposition (g) is established even before 
a formula for the arithmetical calculation of r is at hand. 


(h) is an experimental finding which, coupled with (g) and 
(a), (b), (c) and (d), immediately gives the equation of the 
normal correlation surface. The equation, from the mean, 
of the normal distribution is, 
o1Van 
If the distribution of an array is normal, its standard devia- 
tion = oxV1—?r*, and if its mean is A( = 72/01, x) from the 
mean of the total population, then the equation of the normal 
distribution representing the array, from the mean of the entire 
distribution as origin, is, 


s= 


—(y — roe/o1 x)? 
20% (1 — r?) 


I 

ek 2 Vi-r Nieweis 
The s’’ corresponding to an assigned y is the probability of a 
measure 7m this array having the value y. The probability of 


a measure being in this particular «-array is 


MEASURES OF RELATIONSHIP 159 


Therefore, the probability of a measure having the particular 
value x and also the particular value y is the product of these 


LAE. 


two probabilities, zg = 2’z2’’, 


—x? —(y? — 2 xy roe/o1 + x?720%2/ 01) 
a 5 a 20% (1 — r*) 
270102 VI — r? 
which simplifies to, 
I (5+% —27%) 
g= I e2(t — r) \o%1 a ~~ gia 


2 no\02 VI — Pr? 
(Normal Correlation surface — 2 variables) . . [88] 
This is the equation of the normal probability correlation 
surface of two variables and of a total population of one. If 
the right hand member is multiplied by N, we have the equa- 
tion in case the total population is N. The quantity r has to 
this point been defined as the slope of the regression line in 
case standard measures [see formula 65] are the measures 
entered in the correlation table. We will now prove that in 
any scatter diagram, the two “‘best fit” rectilinear regression 
lines are 


Bp hl SAS pane Sao cc cnool NaS ON) 


in which the two 7’s are identical and given by the equation, 


x Dxy = z xy 
as Tee News aaa 


The term ‘“‘best fit”’ is used as in the method of least squares. 
A “best fit’? determination is one in which the sum of the 
squares or the errors of estimate is a minimum, that is, the 
standard error of estimate is a minimum. Determinations 
can be made resulting in the sum of the deviations; of the 
cubes of deviations; of their fourth powers, etc., being a 
minimum, but since the days of Gauss, it has been known 
that in the case of a normal distribution, none of these deter- 
minations will result in as small a median error as one in which 
the sum of the squares of the errors of estimate is made a 
minimum. The constants of distributions which are widely 
divergent from the normal, so determined that the standard 
error of estimate is a minimum, are undoubtedly very excellent 
determinations, but it is no longer possible to say that con- 


r ..+-.[8ee 9o] 


160 STATISTICAL METHOD 


stants so calculated have smaller median errors than would 
others derived upon a different principle. In all of the follow- 
ing treatment of simple and multiple correlation, the principle 
of least squares is involved and the standard errors are minimal, 
and because of this fact the determinations are called ‘best 
‘fit” determinations. They are ‘‘best’’ if the principle of least 
squares is the proper principle but they may not be so if some 
other principle is more sound, though in all cases we certainly 
can describe the least square as a highly excellent determination. 
Referring to Chart XX, if the slope of the line drawn which 
is the regression of ‘“‘y upon x,’ or the reversion of y toward x, 
is by, (the numerical value of bs; is equal to tan ¢) then having 
given a value x, the best estimate of the corresponding y 
value is y. y = bax. In general y will not be identical with 
the actual or experimentally obtained value of y, so that 
(vy — y) indicates an error of estimate. The standard error 
of estimate o2.1 is given by the equation, 


The regression line which makes this magnitude a minimum is 
the regression line sought. Yule (1912) derives it without the 
use of calculus, but the calculus derivation is so much more 
simple that it is here given. See also in this connection 
problem 6 at end of this chapter. 


pe (y — bax)? — Dy? — 2 ba D xy + 521 Dx? 


f N N 
Ce — 2 2 0y 9 20a ae 
die aN ee aN eS 
eee Zxy  LTxy (Regression coefficient of variable 2 upon vari- 
a= 52 > Now? 
=x No}? able I, or the regression of the dependent 
variable, 2, upon the independent vari- 
able I) wos. s otis VR eee [89] 


This is the desired value of the regression coefficient. If 
standard measures are used the regression equation, 


— (2x 
as a) is 


SP (PRUE Se 
02 Nojo2 aj 


<I 


becomes 


MEASURES OF RELATIONSHIP 161 


and the coefficient Dxy/(Nois2) is the coefficient of correlation, 
r, or the measure of mutual implication, for a derivation 
similar to the preceding and involving the other regression 
line gives 

_ 2Uxy _Zxy (Regression of variable 1 


biz Dy? Nox? upon variable 2) .....[89] 
so that 

x zx \ y 

o1 a eo) o2 


Thus the coefficient of correlation is given by 


Zz xy Zxy =—— (Pearson product moment co- 
= = V bebe 


Nery, Dy? Noor efficient of correlation) . . . . [90] 


and the regression equation may be written, 


Tes ee. (Fundamental form of regression equation 


v1 a2 between two variables) .............. [or] 


The other regression is 


BE a asta ne ee [or] 
a2 O1 
Formula [91] may be written 
ae Or sap 72 
a aD and ¥y ae skal shes fatise Shoe Ree a cakareretees [91 a] 
or as 
Sei Vitae Watley) "=n Day Ceme nner an acne eee eee [gi d] 


It is to be especially noted that whereas 7 always equals 11, 
the regression coefficient by equals be, only in case the two 
variables have equal standard deviations. 


Section 45. Tur DETAILED STEPS IN THE CALCULATION OF 
CORRELATION AND REGRESSION CONSTANTS 


The steps necessary to the calculation of 2x?, Zy*? and Ixy 
are shown below and to the right of the diagram. (Chart XX.) 
The origin taken is 684 inches, but as shown by the sum of the 
fy row (— 20) and the sum of the fx column (38) the exact 
means are slightly different. We will calculate the correlation 
and regression coefficients without correcting for these slight 
discrepancies. They are taken into account in the calculation 
at the close of this section. To avoid working with fractions, 


162 STATISTICAL METHOD 


deviations from the means have been expressed in terms of 
one-half inch intervals. Thus a y-value of — 9 means, 9 one- 
half inches below the mean. In terms of these units we have, 


=z ¢? = 6320 
> & = 2740 
cE = 1618 


This last summation has been calculated in two ways so as to 
provide a check upon the arithmetical accuracy of the work. 
The first entry in the 2ff row is — 17. This is the sum of the 
products of the frequencies of the é’s for the single array for 
which € = —g. The notation, Sfé, is used to designate a 
summation for an array, whereas Zft, or more simply, Zé, is 
the summation for the entire table. Similarly for Sfé& and 
Dir. We have, 


1618 


r= ECPI = .3888 
ba = Iola .5905 (Slope of regression line drawn) 
2740 
bio = le .2560 (Slope of other regression line) 
6320 
7740 


aa = 2.908 (In one-halt inch intervals) 
- = 1.454 (In inch intervals) 
6 ’ 

Con 2 = 4.417 (In one-half inch intervals) 


= 2.203 (In inch intervals) 


The regression of height of offspring upon mid-parent, in 
inch units, and measured from an origin of 681 inches, is, 


ee 888 $ 
2.208 “ 1.454 


or, 
o = .5905 & 


Having the equation in this fundamental form it is but a step 
to express it in terms of gross scores. Letting M’ = the 
arbitrary origin or approximate mean, we have 


MEASURES OF RELATIONSHIP 163 
Accordingly, 


may be written 
Y— M’, = bu (X — M4) 

or 

ve boar X + (M's — ba; M’;) 

& = bi V+ (M1 — b2M":) 

To illustrate the use of this equation let us estimate the 

most probable height of male offspring if the mid-parent 
height is 64 inches. 


Y = .5905 X + [68.25 — (.5905) (68.25)] 
5905 X + 27.95 


Solving, when X = 64, gives, Y = 65.74, as the most probable 
height of offspring, or the mean height of many such offspring. 

The calculation of the constants involved in the regression 
equation as shown assumes that deviations are from the means 
of the two distributions. In case origins other than means 
are used corrections may be applied to secure the product 
moment and standard deviations from the means. The cor- 
rections for the standard deviations have already been given, 
formula [22]. 

Let A, = M’, — M,, the distance from the arbitrary origin 
to the mean of the X’s, and let A, = M’, — Mz. Then, 


DBE ae é 
oe Woe Nog od 
>>) 2 >>) 2 

zy 22.6 ‘ 
2 at aN 2; 
Seas N 


zay = T(E + Ai) (€ + A) 
ZAG As) Ae (E+ Ai) 
me EC A Aue (E Aj) 
and since 2(£ + Ai) = 2x = o and Df = — NAs, therefore, 
Ley = SE — NAiA2 (Formula for correction of product — 
moment due to use of arbitrary 
OHGINS Peseta stot seep ae [92] 


Accordingly r may be calculated from any origins whatever 
by the formula 


Dé — NAiA2 f (Pearson product — moment 


—eUs #2 — NA2, Vz o — NA% coefficient of correlation 
calculated from any origin) . [93] 


r 


164 STATISTICAL METHOD 


When zero is, for each variable, the arbitrary origin the 
above formula is equivalent to 


= DXY — (2X) CY)/N (r calculated from 
0 Ve (= .X)?/N Vz ¥?— (2 Y)2/N _—_2ero as arbitrary 
ee OTIS1)) eee [94] 
Another variation is 
sen =XY — NM,M, (r calculated from zero as 
Vs X2?— NM? V3 Y2?— NM% arbitrary origin)...... [95] 
Similarly, 
ie @ ZXY = NMMa , _ XY — NMM: 
ty Be SON AES ee =X? — NM, 
(Regression coefficients calculated from zero 
AS ALP otaly: OLLI) eee ete eee [96] 


Thus, for Galton’s data, the correct values for the requisite 
constants are 
ve 1618 — (38) (— 20)/324 
r= = 
V 2740 — (38)2/324 V6320 — (— 20)?/324 
ba = .5923 
My, = 68.25 + 38/324 = 68.37 


= .3897 


M2 = 68.25 — 20/324 = 68.19 
a, = 2.906 
o2 = 4.416 


Thus the corrected regression equation from the actual means 
as origins is 

¥ = .5923 x 
which differs but slightly from that obtained neglecting A; and As, 


§ = .5905 & 
and the corrected regression equation from zero inches as 
origins is 

Y = .5923 X + 27.69 
which in turn differs but slightly from that obtained neglecting 
Ay and Ao. 


Section 46. THe ERRoR INVOLVED IN CERTAIN 
APPROXIMATIONS 


It is desirable to know how large an error in the means may 
safely be neglected. We have, letting s; = (Z#)/N and s» 
= (25°) /N, 


x & — NAiAe 
T= —-- - —— m 
N Vs? — AP V2? — Ae? 


MEASURES OF RELATIONSHIP 165 


and we wish to ascertain how greatly this differs from the 


approximate value, 
ae ey 
Nsis2 


2 
Setting the expressions 1/Vs? — A? equal to 1/ sal I — (*) and 


expanding the radical by the binomial theorem, discarding 
powers in A/s greater than the second as being negligible in 
comparison with the second powers, gives, after certain simple 
reductions, 

epi ak (2)'+3 (2)'] _ Aid, (Showing error in r from use 


S152 of approximate means) [97] 


2 


ra 
in which e is the error introduced in case r’ is taken as the 
value of r. Note that if r’ is positive, less error is introduced 
if A; and Ay have the same sign than if they are of opposite 
sign. Let us assume the two magnitudes (A/s) are equal. 


Then, 
t= 8) G) 


In this case ¢ is negative, 1.e., if approximate means which are 
in error in the same sense are used, the obtained correlation, 
r’, is larger than the correct value, r. We may solve the pre- 
ceding equation for A/s for assigned values of 7’ and e. The 
_ following tables give certain solutions: 


Ir Errors IN MEANS ARE EQUAL Ir ErRrRorS IN MEANS ARE EQUAL 
AND OF THE SAME SIGN AND OF OPPOSITE SIGN 

€ tN AY s A = approximately € r’ | A/s | A = approximately 
= Cor | || ee 1/158 of range AKO || 40) || Loe 1/158 of range 
= .005 | .0 || .071 if fal ADO} || 0] Lovie || wf Ga“ 
=I || © | uo || 1// Bo FOTO] NON! LOOn| mali an5 Omen 
= (OOH | o7/ || ORS | Ty By SCO | yf || Soe || exo 
= OR! || Ff | ee) yf ee JOOS He7 O54) ey 2) i os 
= CUO || 7 || steep) auf 1010) 27 \).077 | 1/65 “ 
= {OIF || ey || quterey|| A oe LOOT (0) /-023 |e L/ 218 i 
= (OS || 4e)') Beui|. itp ep $005) On 2051 ml /103 : 
== OK |} 0) |) osHiles || n/a KON) || LO) || KOWAB || teh ey 


Since for A’s of a given size, there is much greater error in the 
correlation coefficient if they are of opposite sign than if of 


166 STATISTICAL METHOD 


the same sign, therefore in choosing arbitrary means, it is 
frequently desirable to so choose that A:, As have the same 
sign. For example: suppose o1 = o2 = 3, M, = 12.56 and 
Mz = 9.30, then better results will be obtained, if correction for 
arbitrary means is not made, by choosing 12.0 and 9.0 (Ai = .56 
and A, = .30) than by choosing 13.0 and g.o (Ai = — .44 
and A, = .30). For many investigations, an error of 1 per 
cent is not material so that, as a practical procedure subject 
to refinement if low correlations are involved or if a 1 per cent 
error is serious, it is safe to forego correcting for arbitrary 
means if the error in each of the means is less than 1/27 of the 
range and if they are of the same sign. This requirement 
is more easily met than one imposing the condition that the 
standard deviation should not be in error by more than 1 per 
cent. As standard deviations are usually features of a distribu- 
tion which it is desirable to know, it seems better to forego 
correction for an arbitrary mean only in case the error intro- 
duced in the standard deviation is less than 1 per cent. We 
have 


og =Vst—At=s— = + higher powers in () : 
The error introduced by using s in place of o is 


_ A’ (Showing error in o from use of approximate 
7G MCAT) jrea cues ste siale tee cherie oe [98] 


See C. 


and the proportionate error is 
Sag 
Sis: 


If an error of 1 per cent is permissible, we may write 
A? 


or A is approximately 1/35 of the range. If there are 18 or 
more intervals in the range covered by the measures and if 
the arbitrary mean is chosen as the middle of the interval 
nearest the correct mean, then the error will be less than 3 the 
interval or less than 1/36 of the range, so that the error in the 
resulting standard deviation will be less than 1 per cent 

The correction just considered is on account of displacement 


MEASURES OF RELATIONSHIP 167 


of the mean. Sheppard's correction, formula [68a], is for 
grouping. If o« equals the correct standard deviation and 
S the standard deviation obtained from coarsely grouped data, 
Sheppard’s correction gives 

o? = $2? — 1/12 

a = S — 1/24 S + higher powers in (1/12 S) 


ol = f= 1 /24 S*? (Showing error in o due to grouping). . [99] 
and if this equals .o1, we have 
S = 2.041 


If the standard deviation is 2 or a trifle greater, the range is 
in the neighborhood of 10 or 12, so that if we have as many 
as 12 steps, the error of the standard deviation due to grouping 
is less than 1 per cent. The most exacting condition is there- 
fore the one preceding this. 

Accordingly, if there are 12 or more intervals in the ranges 
of both variables, and if the origins are so taken, by resorting 
to $ or 4 steps if necessary, as not to differ from the correct 
means by more than 1/25 of the range if the correlation is 
above .70, or 1/50 if near .oo; and if the origins taken lie either 
both above or below the correct means; the error introduced 
in either the standard deviations or the coefficient of correlation 
by not correcting for grouping or for approximate means, is 
less than 1 per cent. In case intervals are of necessity so 
broad that a material error in correlation results, the raw 
correlation coefficient requires a correction for broad categories. 


Section 47. Tue BEARING OF BROAD CATEGORIES UPON 
CoRRELATION 


Writing pu for the product moment Zxy/N, as in Section 48, 


we have 
2a 


0102 
Ordinarily o, and o» will be taken as the standard deviations 
of the class indexes, but more accurate values are obtained by 
first applving Sheppard’s corrections, formula [68 a]. Thus if 
h and k are the group intervals, s; and s, the standard devia- 
tions before applying Sheppard’s corrections, we have, 


r12 


168 STATISTICAL METHOD 


To a first approximation there is no correction for grouping 
to be made to the product moment, pu, so that we have 


re pu ot z 
h? ke h2 R? 
34) —— 4]s% —— Nalsts -— «]s% —— 
12 I2 I2 12 


(Coefficient ot correlation after applying 
Sheppard’s corrections) ............-. [100] 


If the grouping is very coarse and irregular we may assume 
a normal distribution and determine the mean of each class; 
calculate the correlation, using these mean class values as our 
variates, and correct for grouping. The correction for grouping 
is different from Sheppard’s because here our correction is on 
account of using mean-class-values in place of the continuous 
variate, whereas Sheppard’s correction is on account of using 
mid-class-values in place of the continuous variate. To point 
the distinction the following hypothetical problem involving 
trade ratings and general intelligence ratings is given. 


Fess 4; — |s,—OrDI-}| *,.— 
RIN GStOa GENT DAT ENT RO- NATE AT | MEAN 
IN PORTION| UPPER OF 
MENTAL ABILITY Eacu | ABoveE| Limit CLass 
Crass | CLASS | OF CLASS 
Dull Average Bright OO UTE ss 
Bxpert.[ I 4 5 | 10 10 1.755 
Taek Get © 10 | .175498 
man . 4 10 16 30 30 -703 
poten a .40 | .386342 
Ais 4 II 5 20 | 20 .000 
Naticg .60 | .386342 
II 25 4 40 40 —=1.900 
= i 
20 50 30 100 1.00 -000000 
z' 000000 .279962 .347693 + .000000 
y —1.400 —.1I35 1.159 


ox = V80.40968/100 = .896714 


= V82.95276/100 = .910784 
___28.57438 
100 X 89671 X .g10784 


oy 


tyx = =.34987 


MEASURES OF RELATIONSHIP 169 


The symbols g, z and x have the meanings of Section 27. 
Formula [55] is used in determining x from q and z. Treating 
x and y as the values of the deviates from the means, the 
correlation coefficient is, by the usual process, found to equal 
34987. This value, however, suffers from a large grouping 
error. We cannot apply Sheppard’s corrections because we 
do not have equal class intervals and because we have not 
dealt with class indexes, but class means. Whereas s, the 
standard deviation of class indexes, is greater than o, the 
standard deviation of the continuous variates; s’, the standard 
deviation of class means, is less than o. If class intervals are 
equal and equal to the unit used, we have, 


I : 
o? = 5? — s (Sheppard’s correction) .[68 a] 
also 
; I (Pearson’s correction to 
o =s5 2 oa ae Sete 
12 the standard deviation 


of classameans)man ee [ror] 


This second formula, as well as subsequent ones in this section, 
was derived by Pearson (1913 meas.). We thus see that an 
entirely different correction is needed. This last correction is 
not of general utility, as the problems in which we use class 
means instead of class indexes are usually such that we do not 
know that the class intervals are equal. We may, however, 
determine the correction by aid of the correlation between the 
variate and the class means of the classes into which the 
variates are placed. 

Let x be the value of the continuous variate, and x the value 
of the means of the classes into which the x measures are placed. 
Then the regression of the x’s upon the x’s is 


N= Tye 


but x is the mean of all the variates in the class of which x is 
the class mean, or simply x = x. Substituting in the preceding 
equation we have 

ox (Correlation between a variate and the means 

EN, of the classes in which it is recorded)..... [102] 
The standard deviation of the class means o, is obtained by 
calculation, and o, is known if the form of distribution is 


UBGo 


170 STATISTICAL METHOD 


known. For the problem given o, = .896714 and o, = I.0 
since a normal distribution of standard deviation equal to 1.0 
was assumed, so that ryx = .8967/1.0 = .8967, and if y stands 
for the continuous variate in the case of the second variable, 
then r,y = .g108/1.0 = .g108. 

Continuing we may find the correlation between two con- 
tinuous variates when each is recorded in broad categories. 
The following simple derivation depends upon principles of 
partial correlation discussed in Chapter XI. The reader should 
therefore be familiar with that chapter before attempting to 
follow this proof. The symbol r,),, stands for the correla- 
tion between class means for constant values of the graduated 
variates x and y. Clearly when x and y are constant the 
corresponding class means x and y do not vary, so that 
Txyxy = 0. This partial correlation coefficient, rzy,, is equal 
to a numerator determinant divided by the square root of the 
product of two others. The divisor is easily shown to be 
intrinsically positive so that the quotient becomes zero with 
the dividend. Accordingly we have 


Txy Txy Try 


1xx I rxy|= 0 


Yxy Ixy I 


in which r,, is the corrected value sought, and rx, is the value 
calculated, using the means of the broad categories. It has 
just been shown that 7, is equal to o,/c,, and r,x equal to 
ox/ox,. We need to know r,y and r,,. The partial correla- 
tion ryy.y is that between the variate y for a given value of 
the second variate y, and the class mean y for a given value of 
the second variate y. The class mean for a given value of the 
variate is invariable, so that y for constant y is constant and 
accordingly r,y, =o. This partial coefficient can be zero 
only when the numerator of the quotient which is equal to it 
is zero; that is 

Up Oi) oe tp Gays! 18) 
or 

TX SX LYS. 

Similarly 


Tay = Txytxx 


MEASURES OF RELATIONSHIP Ving 


Substituting in the determinant and solving for r,, gives, if 
we let mi sy = xy 
txy  _ Yxyoxoy (Giving the correction to r on ac- 

rxxtyy oxoy count of use of class means). . . [103] 
In case a normal distribution of standard deviation 1 is assumed 
to fit the distributions of the two variables, and the means of 
categories calculated upon this assumption, o, and o, each 
equal 1 so that we have 


ml xy = 


tay _ Dxy (Correction to vy on account of use 
oxoy No 0% of class means, upon assumption 
of a unit normal distribution) . . [104] 

The correction here derived for broad categories is equally 
serviceable when determining correlation ratios or contingency 
coefficients as described in Section 68. 

Note that there are two corrections; one, Sheppard’s, to be 
applied on account of broad equal intervals when class indexes 
are taken as the variates; and the second, the one here given, 
to be applied when the class means of broad equal or unequal 
intervals are taken as the variates. No correction is as yet 
worked out for application when class indexes are used and 
the intervals are broad and unequal, though in such case good 
results may be expected by empirically setting h in Sheppard’s 
formula [68 a] equal to the mean of the several intervals 
involved. 

We may return to the numerical problem and apply the 
correction to obtain the correlation corrected for broad cate- 
gories between trade ratings and estimates of intelligence. It 
yields 


may = 


-34987 
.896714 X .910784 
In this calculation it has been assumed that x is the same for 
the first cell (expert-dull), the second cell (expert-average), 
and the third cell (expert-bright), and similarly throughout 
the rest of the table. This is only approximately true, and in 
case the categories are very broad and the correlation high it 
is far from true. The method should not be used with a four- 
fold table and it is of doubtful validity for the table given. 
It may be applied with good results if no class contains more 
than 25 or 30 per cent of the cases and if the correlation is not 
greater than .9. 


= .4284 


172 STATISTICAL METHOD 


Section 48. PROPERTIES OF CORRELATION SURFACES 


With the scatter diagram of Chart XX before us, the mean- 
ings of certain terms will be readily grasped. If the standard 
deviation of the successive « arrays are equal, the distribution 
is homoscedastic in the x variable and if, in addition, the 
standard deviations of the y arrays are equal, the correlation 
surface is homoscedastic in both senses. If the slope of the 
distribution in an array a given distance above the mean of 
the array, is equal to the slope the same distance below, and if 
this is true of all arrays, the total distribution is called homo- 
clitic; thus, a distribution composed of symmetrical arrays is 
homoclitic. If means of successive arrays lie in a straight 
line, the regression is rectilinear or, by some writers, is termed 
linear. In case a regression table is homoscedastic, homoclitic, 
and has two rectilinear regression lines, the most probable 
value of one variable when estimated from a knowledge of the 
other, is that given by the regression equation. The regression 
determination in the case of distributions showing moderate 
divergence from these three conditions will still be very nearly 
the most probable. Scatter diagrams showing extreme di- 
vergence should be treated by some other method. Lack of 
substantial rectilinearity in regression is the most readily de- 
tected feature of a correlation surface which vitiates the use 
of the product moment coefficient of correlation. For most 
problems, the establishment of rectilinearity is sufficient to 
completely justify the use of the Pearson product moment 
coefficient of correlation. Note that this is a much easier 
requirement to meet than that the correlation surface be 
normal, that is, capable of accurate representation by means 
of equation [89]. Accurate correlation results may regularly 
be expected from distributions showing rectilinear regression 
lines, but otherwise widely divergent from the normal cor- 
relation surface. Due to the fact that Pearson’s early de- 
velopment of the product moment coefficient of correlation 
was based upon the assumption of a normal correlation sur- 
face, it has frequently been assumed that such a surface is 


prerequisite to the sound use of the coefficient, but this is not 
at all true. 


MEASURES OF RELATIONSHIP 173 


Having the means at hand of estimating a second variable, 
knowing a first, it is desirable to ascertain the probable error 
of such determinations. Obviously if arrays are homoscedastic, 
the standard deviation of any array is the standard error of 
any single estimate. 


02.1 =02VI — 7? = ork (The standard deviation of an array or 
the standard error of estimate of a 
o1.2=01VI — 7 = ok second variable, knowing the first) . . [86] 
The quantity k of the above equations is defined in the next 
paragraph. 

With the data of Chart XX in hand, 02.1 = 2.208V'1 — (.3888)? 
= 2.034. That is to say, that if the correlation between height 
of mid-parent and offspring is .3888 and if the standard devia- 
tion of heights of offspring is 2.208 inches, then the standard 
error of estimate of a child’s height, determined from the 
mid-parent height, is 2.034 inches. A guess that the height 
of every offspring is 684 inches would have a standard error of 
2.204 inches so that the increased accuracy of estimate due to 
utilizing the correlation of .3888 between mid-parent and 
offspring reduces the standard error of estimate to 2.034 
inches, or about 8 per cent reduction. It is thus seen that no 
very great improvement in estimate results from a correlation 
no higher than .3888. The proportionate reduction is given 
by the factor V1 — 7°. This factor measures the lack of rela- 
tionship between two variables just as r measures presence 
of relationship. I have elsewhere (Kelley, 1919) described 
certain of its properties and have termed it a coefficient of 
alienation. The coefficient of alienation may be interpreted 
in a positive sense for if a criterion, x, correlates to the extent 
r with a given measure, “;, and if there exists some other meas- 
ure, %, independent of x, but which together with it com- 
pletely determines %, then the correlation between % and « 
is k. Its immediate determination, having any value of 1, 
is given by 


k= V1—?r (Coefficient of alienation). .[86 a] 


and the calculation may readily be made by the aid of the small 
alignment chart given in the appendix or the large chart which 
is a supplement to (Kelley, 1921). To secure an idea of the 


174 STATISTICAL METHOD 


improvement of estimate with increase in correlation, the 
following table is given: 


COEFFICIENT OF COEFFICIENT OF COEFFICIENT OF COEFFICIENT OF 
CORRELATION ALIENATION CORRELATION ALIENATION 
r k r k 
.00 1.0000 .80 .6000 
.10 -9950 .8660 .5000 
.30 -9539 .90 -4359 
.50 .8660 -95 sQi2e 
.60 .8000 .98 .1990 
.70 -7141 -99 -I4II 
.7071 .7071 1.00 .0000 


Notice that a correlation of .866 is necessary before the error 
of estimate has been reduced a half, and that even with a 
correlation of .99, the error of estimate is still 1/7 as great asa 
sheer random guess. It should be obvious from these facts 
that if individual estimates are to be made, it is necessary 
that very high correlation be present in order to secure even 
moderately reliable results. 

It is sometimes convenient to work with probable errors 
instead of standard deviations, in which case we have 


P. E.1.2 = P.E.; k (Probable error of estimate of the second 
variable, knowing the first) ..........[86 5] 


The calculation of the formula for the probable error of the 
coefficient of correlation is involved and has several times 
been given (Sheppard, 1898), (Pearson, 1913, freq.), and is 
not repeated here, but the formulas upon which it is based 
have general value. Not only the probable error of the coeffi- 
cient of correlation, but many other probable errors as well, 
depend upon certain higher product moments and upon the 
correlation between product moments. The notation and 
meaning of product moments may be made clear by certain 
illustrations. 

DXY 
Pu = ae 


and is a second order product moment, 


= XY? 
p= 24h 


MEASURES OF RELATIONSHIP L735 


and is a third order product moment, 


> X3y4 
px = N 
and is a seventh order product moment, etc., and in general, 
See 
bay = WN 


gives a product moment of the (qg + q’) order around some 
fixed point. Following Pearson we would use the symbol 
Pq to represent the same product moment around the mean 
as origin, but as moments around the mean are the only ones 
here concerning us, we will drop the superior bar and use pygq 
in place of fy’. The meaning of the notation may be il- 
lustrated by a few examples involving familiar constants. 


p ee ye ee 
10 N aie 
ee ye 
PRS =O 
REP epee > 
P20 ie Te he 


pu = 


= 1120102. 


els 


Section 49. STANDARD DEVIATIONS AND CORRELATIONS OF 
VARIOUS CONSTANTS 


The standard error of any product moment is given by the 
equation (Pearson, 1913, freq.), 
No 5, 7 = brged — Pad +P bw b?a—1,0 +g” por ba, g'-1 
+2qq bu ba-1,¢ baa-1 — 24 atid ba-1,0'— 2 Wha, +1 Pa, g'-1 
(Standard error of any product moment from the means). . .[105] 


The correlation between any two product moments is given by 


No 50 0 © punt Pq Puyal ~ PI tud tu — pad Puw + 9Up» pa —1,4' Pu-1,u' 
+ q'u' po ba, U1 pu, w—-1+ qu pu Pa-1, 7 Pu, w—1t qu Pu ba, a —1 pu-i,w 
—ubgti,d pu-1,w —U' pad +1 bu,w—1 — Qbuti, wv ba-1,7 
— q'pu,w+1Pa,q'-1 

(Correlation between any two product moments taken from the means) .[106] 

These two equations provide the basic relationships which 


lead to the following special probable errors and correlations. 


176 STATISTICAL. METHOD 


As will be noted the formulas greatly simplify if homoscedasti- 
city and rectilinearity are assumed, and simplify still further 
if normality of correlation surface is assumed. 

Standard error of the mean, 


Standard error of the standard deviation, 
co = T= (Assuming a mesokurtic distribution). .[32 a] 


Standard error of the regression coefficient, 


I-?r ok (Assuming homoclisy and 


hs. oy N ooVN rectilinearity)........ [107] 
Standard error of the correlation coefficient, 


an ERSTE 9 Is + Po — Por , Pre — Pr Po2 
rxy  N Pu a 4 p?2 4 poz 2 px poz 
Psi Dre pis — Pru be) 
Pir P2 Pir Por 
(No assumptions except that [error/N] to second and higher 
powers are negligible in comparison with first powers). .. .[108] 


This complete formula was first given by Sheppard (1898) 


= (1-1 + Bs — 8) a) 


(Assuming rectilinearity of regression. This assumption 
carries with it the necessity of equal kurtosis, if arrays 
ATG MOMOSCCCASLIG)iea se cc raras cise crite r aene een eae ae [108 a] 


Or 


This formula, as well as others in this section, is given by 
Pearson (1913, freq.). The constants, 62 and 8’, are the p,’s 
for the two distributions. 

k® (From preceding formula, assuming mesokurtosis, 

/N Ai ACGIION) Sevieod > ala ivi Sane cee eee ee [108 b} 
This standard error was first derived by Filon and Pearson 
(1898), upon the assumption of normality, but note that the 
formula is in fact more general than this. Also note that if r 
is high and the kurtosis small, the formula gives too small a 
value; and that if the correlation and kurtosis are high, the 
formula gives too large a value. 


Cr = 


Sep (+ II p? eS) (Standard error of r to a second 
Or . . 
N= ay 4(N = 1) approximation).............[108 ¢] 


MEASURES OF RELATIONSHIP 177 


In the derivation of this formula, squares of the magnitudes 
lerror/N] were kept and normality of correlation surface 
assumed. (Soper, 1913.) The magnitude p is the true cor- 
relation and for such small populations as this formula is 
intended it may lead to substantial error to use 7, the obtained 
correlation, in its place. This is particularly true if 7 is very 
large. However, the use of 7 in place of the unknown value, 
p, 1 r < .g5 and calculation of the standard error of r by the 
above formula in case N < 25, should give better results than 
formulas [108 a] or [108 0]. If formulas [108 a] or [108 8] are 
used for these small populations an improved result may be 
expected by multiplying the standard error given by them by 


it Gee Sy ACM ban sosarn cece wou one 66 [108 d] 


As a practical matter, r determined from samples < 5 may be 
considered meaningless and nearly so if determined from 
samples < 7. 

Standard error of the constant term (M,; — byM.) of the 
regression equation. Let c = (VM, — byMe2). Then, 


o, = %.V M2 + 0%, (Assuming homoclisy and 
rectilinearity)........[109] 


Standard error of the estimated mean of an array, 9, (the 
mean y score of the x-array). 


_ 72k NE a ae (Assuming homoclisy and 
VN oy rectilinearity)........[110] 


C= 
Vx 
Note the decrease in the accuracy of the means of the arrays 
as we go further and further from the mean of the total distri- 
bution. A further important consequence of this equation 
is that for certain situations it gives the standard error of the 
mean of a total population [see formula 111] since the esti- 
mated mean of the array for x = ois the mean of the total 
y-distribution. 
_o@?k (Standard error of asecond mean in case a first mean 
°Ma ./N is known with zero error, and in case the correla- 
tion between the two series of measures is 7).... [111] 


Certain correlations between the constants of a correlation 
surface are at times needed. Let , = the frequency in row 


178 STATISTICAL. METHOD 


s; ns’ in column s’; and n,;’ in the compartment or cell 


given by the intersection of the s row and the s’column. Then, 


Nss! 
on = M,, ( — =) dist Gatos bl oT bis MESO ES [112] 
Tn Try Thee My = — “see ODE Le OI LL io Nea Oo ok [113] 
, Ns Ns! 
on, on, nny = iss = Ne Sue gistin Teivel ia isi [ote erebstteveale to veiet ee iyien ris [1 14] 
NsN1s' 
on, Ons 1 nyt — — WN EE ar a nr a hers ica ter [115] 
n 
In, Tn. Money = nss" (: ~ a Pe MIN sae nce abo: [116] 
_ Mss’ (Correlation between the mean 
TMi Ons "Ming ~ and the frequency of a cell)... .. [117] 
1M, Mr = "1 (Correlation between means) ..... {118] 
po — pao poz 
f, = ; 
fk V pio = po0V pos p09 
nh por — mom's 
Vig — po? Vlg — p’? 
(Correlation between standard deviations) . .[119] 
Pet y = 72 (6, — 1) = 7? (62 — 1) 
M2 2 
(Assumption that both distributions are ho- 
moscedastic and regressions rectilinear). . . [120] 
Thus, 
Yoior = 2 (Assumption of rectilinearity, homoscedasticity and 
ecigal eurtOsis)):. awe Fo oi deo Ue ee ee {121] 
pis = 10102 B’2 (Assumption ot rectilinearity)............. [122] 
r Sr = Bi (No assumptions) [123] 
ip Pits apart DhHONS) Sere sere 3 
If B: = 0, then Fig oO apis ib asa Us we pins Maleate +[123;)a] 
ent rv Bi — 7 VB") (Assuming rectilinearity and meso- 
rM, 2 k? Te GOSIS) sce a alee es eee [124] 
fa o (Assuming rectilinearity mesokurtosis and ho- 
MOCLISV:) ere aenceie wr cherish, Ukr a ee ene {124 a] 
r 
TPE ee (V Be —-1I-r VB’s = 1) 
2 or N 
(Assuming rectilinearity and homoscedasticity). . [125] 
r ; nei ae 
ee (Assuming mesokurtosis in addition to above)... .[125 a] 
2 


MEASURES OF RELATIONSHIP 179 


Let C= (M, = byMs). Then 
Mook» 
No* 20,05, 


Let By.3, 813.2, etc., be defined as in Section 80. Then, 


tobi, = — (Assuming rectilinearity and homoclisy) . . [126] 


I ; : 
tore = WE (712B13.2 + 713812.3) (Assumption of normality)... .[127] 
2 
Yria = % (B31. 2824.3 + B14.3832-1 + B13-4842-1 + Bar. 2823-4) 
(Assumption of normality)........ [128] 
A ae 71213 (R223 — 112 — 7713 + 2 ror isres) 
12713 23 2 Rok 719 
(CASStimM pULOMIOL notina lity, iene [129] 


The last three equations were first given by Filon and Pearson 
(1898). Formulas for a number of the preceding standard 
errors and correlations, not involving the assumption of 
normality, are given by Isserlis (1916). He also gives reduc- 
tion formulas for higher product moments, such for example 
as for Pryz. 


Section 50. FoRMULAS FOR THE CALCULATION OF THE Propuct- 
MoMENT COEFFICIENT OF CORRELATION 
There are a number of useful variations of form in the 
product-moment formula. The equivalence of all the follow- 
ing statements should be immediately recognized by the 
student: 


2D 3122 . : x1 Xo 
(@) fe = WV ,in which 2; <0 and 22 cde 
D X1X2 
tr = 
(bd) rie eee 
Omn.= = xy (Pearson product- 
ee News moment coefficient 
(d) Zxy = Nri20102 of correlation) [90] 
(e) Pu = 120102, OF 112 = Ze 


(f) te = ba = ba j 
In case a table of squares is employed it is simpler to work 
with sums and differences than with products: Let d = the 
difference between two deviations, each taken from its mean. 
We have 

5 32 (Ga MP a FY Ss BE 

oq = = = 

N N 


= o%, + o%) — 2 roice 


180 STATISTICAL METHOD 


o*, + «22 — ad (Difference formula for 7, based upon 
ee deviations from means)...........- [130] 


in case x and y are equally variable, so that o; = o2, we have, 


oq (Difference formula for 7 in case of equal vari- 
2h is ability, based upon deviations from means). . [131] 


an Cees 


Utilizing the usual relationship between a standard deviation 
around a mean and that around an arbitrary origin we may 
express the last two equations in terms of gross scores. Let 
>D, = the standard deviation of the gross scores X around the 
origin, X = 0; 2X» that of the Y’s, and 2, that of the quanti- 
ties (X — Y), and let M, and M; stand for the means, then 
the following formulas are easily derived from the preceding 


two. 
PN a IE a F2 M. M2 —=*d (Difference formula for r based 
ioe Bey Ss a Es (pon gross scores)........ (132] 


In case the means, and standard deviations, are equal, — such 
a case as would arise if two similar forms of a test are correlated, 
the formula becomes 


: 22d (Difference formula for 7 based upon 
r=1-—— ; 
2 (22 — M?) gross scores and in case means and 
standard deviations are equal)....... [133] 


The difference formula based upon gross scores may be trans- 
formed into one involving summations instead of averages. 
Let Si = N 24, Se = N 2%, Sg = N 22g, [=X = NM, TY= NMsz. 
Then, we have, 

aN ~ —(sx)(s y) (Difference formula for 

> (Si + S2 — Sd) — (2 X) (& Y) 


J fie aL RE ec y based upon sums of 
VNS, — (5 X)* V NS: — (3 Y)? gross scores)........ {134] 


Formulas such as [134] involving gross scores only are advanta- 
geous in that they lend themselves readily to mechanical and 
routine calculation. The numerical figures involved frequently 
become large but this is not much of a handicap, if a table 
of squares is used, and if an adding machine is available. 
Formulas similar to certain of the preceding, based upon 
sums instead of differences, are as follows: Let o, stand for 
the standard deviation of the sums of the deviations from the 
mean (x + y), and ¥, for the standard deviation from zero 


MEASURES OF RELATIONSHIP 181 


of the sums of gross scores (X + Y), and let other symbols be 
as above, then 


ae o*s — 67; —o*, (Sum formula for 7 based upon deviations 


2 0102 sseehoahscalchelc)2, oar am cea Otor eon [135] 
Paco © (Sum formula for r based upon deviations from 
215" means in case of equal variability)........... {136] 


Eliminating o? from formulas [131] and [136] gives 
o?s — o%d (Sumand difference formula for r based upon devia- 


a os 1 67d tions from means in case of equal variability) . [137] 
If gross scores are used and if means, and standard deviations, 
are equal, formula [137] may be transformed into the following: 


ear 2 ae (Sum and difference formula for r based 

D?s + 224 — 4 M? upon gross scores in case means, and 

standard deviations, are equal).......[138] 

A general formula based upon the standard deviations of 
sums may be readily derived and is sometimes useful, as is 
also one based upon summations of sums. 

In general; if, for a given problem, certain relationships are 
known to hold ahead of calculation, such, for example, as 
equal means, equal standard deviations, proportionate means, 
proportionate standard deviations, means or standard devia- 
tions having known values, etc., a simpler formula than the 
general one may be derived. If inexperienced help is doing 
the work, a mechanical routine method not involving such 
mental operations as multiplying three times seven, but rather 
such operations as copying 197244 and adding on an adding 
machine, is serviceable. If multiplication as high as twelve 
times twelve, and good judgment in selecting approximate 
means can be counted upon, the method used upon Galton’s 
data is probably the most expeditious. 


ig 


Section 51. THe INTERPRETATION OF REGRESSION 
COEFFICIENTS 


The derivation of the correlation coefficient shows it to be 
the regression coefficient in the case of standard measures. 
The regression coefficient is statistically the more fundamental 
and in all actual problems involving the estimate of one variable 
knowing a second, the regression coefficient and ‘not the cor- 
relation coefficient is the essential measure. A wider use of 


182 STATISTICAL METHOD 


regression coefficients in place of correlation coefficients would 
lead to a more accurate and detailed understanding of the 
situations portrayed. We may illustrate this by the data 
of Chart XXI, but will first need to know the standard error 
of a difference. This is readily derived. Let d equal the 
difference between two measures XY and Y, whose means are 
M, and M, and let x and y be defined by the equations x = 
X — M,, y = VY — My, then 
d=X —V= (x —y)+ (hi — M)) 
If any constant is added to or subtracted from d, the standard 
deviation around the mean is not altered so that 
%q — F (d + Mz — Mi) 
and since 
d+ Ms—Mi=x-—y 
we have 
Ob ROBIE) 
but o ~ —y is simply og of formula [130]. Solving [130] for 
og we have 


od = Vo, + 0% — 2 P19 o102 (Standard error of the difference be- 
tween 2 correlated measures). .. . [139] 
in which o; is the standard error of the first measure, o2 of the 
second measure, and 71 is the correlation between the two 
measures. In case the measures are not correlated we have 
od = Vo; + 0% (Standard error of the difference between two in- 
dependent imeasures)iny- neice tenner een TO) 
The constants calculated from this chart, including the cor- 
relation ratio » and the test for linearity ¢, described in Section 
68, are as given in Table XXXVI, in which variable one is 
the percentage of men voting for Thompson, and variable two, 
the percentage of women. 


TABLE XXXVI 
Standard errors of 


M, = 60.768 M2 = 60.558 Mi, .374 Mo, .441 

01 = 14.707 o2 = 17.354 Ci 204 mos eo re 

bis = .73527 be: = 1.02377 bya, .0107 ba, .0149 
ry. = .86761 r12, .0063 

mz = .86942 na = .87112 m2, -OO61 21, .0062 

$12 = nin — 7712 fa. = n*21 — 7712 $12, -0040 (21, .0028 


= .OooI! = 00314 


MEASURES OF RELATIONSHIP 183 


The method of calculating the standard error of ¢ is given 
later, but since its probable error is nearly as large as itself, 
rectilinearity is shown to be a sound assumption. Let us, 
therefore, consider the other constants and attempt to answer 
the following questions: 

(1) Is there a sex difference in regard to the mean tendency: 
that is, is the difference (M, — M2) which equals .210, one fifth 
of one per cent, a significant difference? 

CHART XXI* 


CORRELATION BETWEEN SEX AND VOTING TENDENCIES 
PERCENTASE OF MEN VOTING FOR THOMPSON 


| fe [7 [12] 17[22[ar|52 [37] 42] 47] 52 [57 oe] er|72 [77] 22 [e792 97 
es 


EE NEE RCL 
| aocm lee et tN eae 


Ts 
=e) 
i 

Moan Bae 


BG 
Kiw 
OQ} oy 
RIRGE| 
i] 


SCs 
Besa 
Snigdaaocea 
Heceerme:. 
eles 
ce 
y 

Py 


= 
BBEEERMBE 


“N 
N 


S 


me hilo] a} rm -_ 
Nl/wMIN| MIN N 


HD Be 
5 Beer ace 


Plelels| || 


PUB 
VARIA 


Lele 
K/ tw 


bl SUS oms lee oN |e 


1 | 
=| oe 


= ae ES 
ae ey | aa ae 


ie) 


Ve; 


sae 
No 
a 


PERCENTAGE OF WOMEN VOTING FOR THOMPSON 


a 
zi 


Ol 
o 


[S)] 
a 


Aine eet] Sele Yl | ag a 
a|o}s 
A] Oo] ® 


ay 
foal [2 [7 [8 [6 [25] 56|51 [5] izo)iei [43] |s5/2z5]er4)ir5|ce]z2/ 0 | [sae 


M, 


* Correlation of percentage of men’s votes cast for Thompson (abscissa) and percentage 
of women’s votes cast for Thompson (ordinate) in 1546 precincts in the Chicago municipal 
election of April6, 1915. Percentages are of votes cast for the two leading candidates only. 
Class intervals run from 4.5010 to 9.5000, etc., per cents. The middle of the class intervals 
are 7.0005, 12.0005, 17.0005, etc. The .0005 has been dropped in the calculations, and the 
class symbols are given as 7, 12, 17, etc. The number of votes per precinct did not differ 
greatly and ran about 400 per precinct, about 35 per cent being votes of women. The data 
were gathered from official returns by Professor J. W. Canning. 


184 STATISTICAL METHOD 


(2) Is there a sex difference in regard to the variability of 
mean precinct votes: that is, 1s 02 — 01 = 2.047 a significant 
difference? 

(3) Is there a sex difference in regression of mean precinct 
votes: that is, is bx — by = .28850 a significant difference? 
We can answer these questions by using formula [139] if we 
know (1) the correlation between means, (2) that between 
standard deviations, (3) that between regression coefficients. 
By formula [1 18] 


Yam, = 112 = -8676 


by formula [12] 
Yo,0, = "122 = .7527 

We have no formula for the direct calculation of the correlation 
between b’s, but we do not need one. If the difference by — ba; 
is significant, then the quotient, by/be, is significantly different 
from 1.00, but by/be1 = 971/0%. Therefore if by/be: is signifi- 
cantly different from 1.00, 01/02 is also, but if this is so, then the 
difference (0; — o2) is significant. Accordingly if we prove 
that there is a significant difference between the two standard 
deviations, we have with the same certainty proven that 
there is a significant difference in the two regressions. 

Letting og stand for the difference of the measure under 
discussion, we have 

M, — M2 = .210 


oq = V(.374)? + (.441)? — 2 (8676) (.374) (.441) = .219 
a2 — 0, = 2.647 
Ci V (.264)? + (.312)? — 2 (.7527) (.264) (.312) = .207 


As the standard error of the difference between the means is 
equal to the difference, we cannot conclude that the difference 
is significant, but as the standard error of the difference between 
the standard deviations is but 1/12 of the difference, the point 
is definitely established that there is a sex difference resulting 
in difference in the standard deviations and in the regressions. 
In other words, on the average, throughout the city, men and 
women voted for Thompson to about the same extent, but 
judging by the precincts, the women tended to vote in blocks 
to a greater extent than men. If the precinct was a “’Thomp- 
son precinct’’ the majority given to Thompson by the women 
was greater than that given by the men, and if it was an “anti- 


MEASURES OF RELATIONSHIP 185 


Thompson precinct,” the majority against Thompson given 
by the women was greater than that given by the men. One 
precinct in particular is a notable exception. This is the one 
recorded in row 12 and column 77 of Chart XXI. There 
existed in this precinct a very strong anti-Thompson women’s 
organization, with the result that though 77 per cent of the men 
voted for Thompson, only 12 per cent of the women did so. 
The two regression lines involved are drawn and the constants 
given in detail in order to point the significance of regression 
lines. That there is a correlation between the votes of men 
and women is of quite secondary interest to the fact that there 
is a wide difference in the regressions of the two sexes. The 
interpretation of the correlation table given hinges upon the 
slopes of regression lines in a much more fundamental sense 
than upon the value of the correlation. 


Section 52. Propuct-MomMENT CORRELATION OF NON- 
RECTILINEAR DATA 


We will now consider a problem involving the calculation of 
a Pearson product-moment coefficient of correlation from non- 
rectilinear data. I am indebted to Mr. H. A. Richmond for 
the accompanying problem and data. Each entry in Table 
XXXVII is for a single state, except the starred entry which 
is for the District of Columbia. From considerations alto- 
gether outside the data it seems appropriate to consider the 
District of Columbia data not to be homogeneous with the 
rest, and they are accordingly omitted from calculations. 

CuHart XXII CuHarTtT XXIII 


PER CENT WHITE POPULATION PER CENT WHITE POPULATION 
0-3 -]e0-ea 70 bo- ps] 


INSURANCE PER CAPITA 


INSURANCE IN FORCE 


186 STATISTICAL METHOD 


TABLE XXXVII 


Per CENT WHITE | PER Capita INSUR- PER CENT WHITE Per Capita INSUR- 
POPULATION ANCE IN FORCE POPULATION ANCE IN FORCE 

99 341 95 304 
99 285 95 251 
99 270 95 237 
99 219 94 140 
99 192 93 103 
99 190 go 167 
99 170 88 142 
99 — 224 87 105 
98 321 84 254 
98 290 83 207 
98 272 82 227 
98 269 82 IOI 
98 253 78 133 
98 244 we *347 
98 241 71 — 96 
98 182 68 121 
98 171 67 158 
97 272 58 133 
97 234 yh 105 
97 204 56 126 
97 197 54 147 
97 182 44 m2 
96 237 43 84 
96 202 

96 190 

96 176 


Let X stand for the per capita insurance in force, and Y for 
the per cent population, then calculation gives 


ri2 = .6430 
Vue 37955 
Corrected for fineness of grouping error 

m2 = -7310 (Calculation given in Section 68) 
nu = .so1g 

Corrected, na = .7394 
$12 = 712 — 1712 = (7955)? — (.6430)? = .2193 
go, = .1202 (Calculation by formula...... (197]) 
12 
—— = 1.82 
oe 3 


so that (from Table K-W), the chances are 34 in 1000 that the 
true regression is rectilinear. The small population makes it 
impossible to prove the appropriateness of a certain regression 
line, rectilinear or otherwise, but with only one chance in 30 
of the regression being rectilinear, we will proceed on the 


MEASURES OF RELATIONSHIP 187 


assumption that it is definitely non-rectilinear. Since the 
populations in the successive arrays are very small, the regres- 
sion line following all the chance fluctuations of the means of 
the arrays leads to a measure of correlation which is too large 
to represent the truth. Accordingly .64 is too small and .80 
too large, and the true regression is neither a straight line nor 
one following all the means of the arrays. A value in the 
neighborhood of .7394 is more trustworthy than either of these. 
As an empirical procedure, which will result in a more reason- 
able regression line, and a measure of correlation between 
.64 and .80, we may use a coarser and coarser grouping of 
percentages as the data deviate more and more from the 
mode, assign interval values to grouped data, and calculate a 
Pearson product-moment coefficient as shown in Chart XXIII. 
Percentage scores are transformed into auxiliary scores accord- 
ing to the following table: 


PER CENT OF WHITE | 43 | 54 | 64 | 73 | 81 | 88 | 94 | 
Pewee ING sey |) we: || wo || te, |) axe) |) ane) |} tKoy || Ley |) lsh || we) 


FOoLLows eee ele 3h OS) ee 2 CO! Ioan OA 
ASSIGN FOLLOWING 


ee eee | 2 SAS One Tale ee 


This transformation scheme is empirical but it should be 
noted that it has not been so drawn up as to capitalize chance 
fluctuations, thus giving a spuriously high measure of cor- 
relation. We are not endeavoring to secure a high measure 
of correlation such, for example, as the raw correlation ratio, 
but rather a reasonable measure; and second, we desire a 
procedure which permits estimating one variable, knowing 
the second, which the correlation ratio method does not permit. 
We may judge of the excellence of our transformation scheme 
by the approach of the resulting product-moment coefficient 
of correlation to the mean of the values of the two corrected 
correlation ratios (.7310 + .7394)/2 = .7352. With this auxil- 
iary score which bears a 1 to 1 relation with percentage of 
white population, the regression is practically rectilinear. The 
means of the arrays vary from a position on a straight line 
only to a degree which we may reasonably attribute to chance. 
Since there is a 1 to 1 relation between the auxiliary variable 


188 STATISTICAL METHOD 


and per cent or white population, an estimation of the auxiliary 
variable is equivalent to an estimate of the per cent of white 
population. The Pearson product-moment coefficient of cor- 
relation found between the auxiliary score and insurance in 
force is .7146 which, though it is not quite = .7352, the most 
reasonable value, is certainly an improvement upon either 
the straight correlation coefficient or the raw correlation ratio. 

In addition to enabling an estimate of one variable from a 
second, and to providing a reasonable measure of correlation, 
a reduction of one variable so as to yield a rectilinear regression 
with a second makes possible an investigation of multiple 
correlation tendencies which otherwise would be very laborious 
or altogether impossible. 

If we have three variables, Xo, X1, X2, and desire to know 
all the interrelations, we require information as to six regres- 
sion lines which we may call lo1, Lio, loz, 20, he, I. Let us sup- 
pose that the correlation table involving variables o and 1, 
shows 2 rectilinear regressions, Jo; and Jo, and that the regres- 
sion Jo: is curvilinear, and that the nature of the others has not 
been determined. Let us suppose that a simple transforma- 
tion of Xe scores into auxiliary X»’ scores results in a rectilinear 
log regression line. Then as proven by Isserlis (1914), the 
additional regression lines 0, ly, and J, are also rectilinear. 
The proposition may be stated in the words of Isserlis, who 
uses the word “‘linear”’ as we have used rectilinear; ‘‘We may 
conclude then that in general the linearity of any three of the 
six regression lines involves that of the remaining three.” .. . 
(Isserlis’ theorem.) 

Obviously the principle can be extended to any number of 
variables. Let Xo be the dependent variable or the criterion, 
and let Xi, Xe, X3...Xn be independent variables which 
are combined into a single score for the purpose of estimating 
the criterion. Then, if each independent variable showing 
curvilinear regression with Xo is transformed into auxiliary 
scores having rectilinear regression, not only every correlation 
with the criterion but every intercorrelation between the inde- 
pendent variables as well will be rectilinear. For example, 
given the four variables Xo, Xi, X2, Xs. Let us suppose that 
none of the regressions are rectilinear. In this case the first 


MEASURES OF RELATIONSHIP 189 


investigation to make would be to see if a simple transformation 
of Xo may not result in making all the regressions involving 
Xo rectilinear. If no such transformation is possible, we may 
transform the scores of the independent variables. We have 
the curvilinear regression lines I, lio; loz, leo; los, 130; liz, ler; 
his, 131; les, 132. Probably a transformation of some one of the 
independent variables can be made so that both regression 
lines involving it and the criterion, that is lo, lio, or oz, leo, or 
lo3, 130, become rectilinear. This is probably always possible 
in case of single valued functions. Rietz (1919) has shown 
the impossibility of accomplishing this in the case of multiple 
valued functions. Let us then so transform X1, Xo, and X3 
that the following regression lines, l’o1, 1/10, l’o2 and 1’o3 are recti- 
linear. Since 1’, 1/19 and lo. are rectilinear, we know, by Isser- 
lis’ theorem, that l’s9, 1/12 and J,; must also be rectilinear, and 
since J’o1, 2’19 and l’o3 are rectilinear, 1’39, 1’13 and 1’3; are also, and 
since 12, l’o9 and l’o3 are rectilinear, 1’23 and 1’3. are also, com- 
pleting the list. An extension of the method to m variables 
shows that for the practical purpose of estimating X1 scores we 
may make empirical single valued transformations of the de- 
pendent variables, wherever necessary to bring about rectilinear 
regression, and then proceed to calculate the multiple regres- 
sion equation as described in the next chapter. Thus for 
single valued functions a lack of rectilinearity ordinarily con- 
stitutes no bar to multiple regression procedure. 

We have, to this point, considered the significance of corre- 
lation as a measure of mutual implication and as a measure 
derived from the regression coefficient. This interpretation 
is to be looked upon as basic in correlation treatment. There 
are, however, other ways of interpreting it, which may oc- 
casionally be of value. Weldon (see Brown 1911) has related 
the correlation coefficient to the percentage of elements which 
are common to the two series of measures involved. Suppose 
standing in trait X depends upon the presence or absence of 
A + C independent elemental factors, and that standing in Y 
depends upon the presence of B + C independent elemental 
factors. The C factors are common to both X and Y. The 
A factors influence X alone and the B factors, Y alone. Further, 
suppose each factor is as likely to be present as absent, i.e., 


190 STATISTICAL METHOD 


p = q = 3, and when present, to add one half to the trait 
score, and when absent, to subtract one half from it. Then 
x=A+C; y=B+C; and in the long run, ZA = 2B = 
SC =o. Let ma equal the number of A factors, m of B and 
n, of C factors, then 


o4 = Vn, bg = Ving 3 = 3g 


op = 3 Vy 
oc = 4Vn, 
cA ppabVvngtn, 
TR+ CH EVM tM, 
Nroise = Tey = (A + C) (B+ C) = ZAB+2AC+2BC+2C? 
= eS Noe 


since, by supposition, all the elements are independent, all 
summations of products equal zero. Accordingly 
Ne 
ae V na + Nc V nb + Nc 
If the number of elements determining the score in X equals 
the number determining that in Y, m2 = mu» and we have 


Nc 
+ = 
Na + Nc 


or, the correlation coefficient is the proportion of elements 
common to the two traits. 

Again, suppose trait X is determined by u, elements and that 
trait Y is determined by these plus m» additional ones, that is, 
Nq = Oo, then 

Ne 
tee nga Nb+ Nc 
and 
pie Ne 
Nb + Ne 


or, the square of the correlation coefficient is the proportion of 
elements determining X which are involved in Y. We of course 
do not know that traits or scores are due to summations of 
independent elements, so that these results at best have rather 
doubtful interpretive value, whereas, the interpretation of cor- 
relation in terms of regression never fails. Thomson (1919) 
and Brown and Thomson (1921) deal very fully with this 
subject. 


MEASURES OF RELATIONSHIP IgI 


It has been assumed that the limits of the coefficient of 


correlation are — 1 and 1. This may easily be proven. Let 
as 2. 


PRET Gane and 22 — 
a1 


q 
no 


then o,, = 1.00 and o,, = 1.00 
(3; — 22)? >0 
I 
We (21 — 22)? = Dai? + Sze? —2 D222 =1 +1 —2r 
but 
>?) (21 = 2)? => ©) 
therefore 
2i(1—a7) On On rior 
Thus the upper limit of 7 is + 1. 
D (41 +22)? =2(1 +7) >oorr> —1 


Thus the lower limit of ris —1. Accordingly all values of r lie 
between — 1 and 1. 


Section 53. Tue Rank Metuop or CALCULATING 
CORRELATION 


The product-moment method of calculating correlation may 
be used when differences in merit are expressed in ranks and 
not in graded scores. Formula [130] is the most convenient 
to use in deriving the expression for the coefficient of correla- 
tion when ranks are used. 

The standard deviation of the ranks in the one trait equals o, 
and of course equals the standard deviation in the other trait, 
oa, as the number of ranks is the same in the two cases. It 
should, however, be noted that if scores such as 


OS 40 ee OO nS 7.) 05401551 85-818 S075 
are assigned ranks 

Pee STENT 5 he TNT Oe fOs It 
the standard deviation of these pseudo ranks is not identical 
Wit monet mOLMratl<Smiga2, 2.64, 05, 0) 7hO./0,10) Trees Only shoht 
error is introduced in case ranks are but occasionally divided 
between two paired measures, but if there are many individuals 
all given the same rank decided error is present. 


192 STATISTICAL .METHOD 


Since the standard deviations are equal the equation becomes, 
using p in place of r as is customary when dealing with ranks: 
arene: 

pe or 2 No?* 

The Xd? is to be determined by recording the differences in 
ranks of the individuals in the two traits, squaring and sum- 
ming. The common standard deviation, ¢, may be found from 
the number of ranks, which is also N, the population. It 
is only necessary, therefore, to determine the standard devia- 


tion of the series 1, 2, 3,...N around its own mean. We 
have 
: = 1+2+3+-:-:--N N+1 
Ri now inate pare eee 
Se Late Oa NG AUN ON ict 
He = = 
N 12 


This value for 42 may be obtained by first determining the 
second moment, ms, in case the distribution consists of fre- 
quencies evenly spread over the class intervals, as indicated 
in the accompanying figure, instead of being concentrated at 
the class indexes or mid-points as is the case when measures 


aweneon me 


Oo12845 K n 


of rank position are used. The frequency distribution drawn 
is represented by the line y = 1 and extends from x = 4 up 
to x = N+ 4. The second moment from o of any one rank, 
let us say the k’th, is k?, whereas the second moment of the 
distribution y = 1 from (k — 4) to (k + 4) is given by the 
equation 
ie Suivi =|, es Se ae 

The moment of the frequency » = 1 corresponding to this 
k’th rank, 1/N of the population, is 1/12 too large, as is of 
course the case for every other rank; hence the second moment 
of the equation y = 1 from x = § to x = N+ 3 will be larger 
than the desired second moment by 


n(s 
N \12 


MEASURES OF RELATIONSHIP 193 


That is 
Me = pz + qs 
2. I N+4 4N?+6N+3 
—h 2 ie ee a Se 
Ud Nae ; yx?dxc a 
2 
Therefore re Aes +8 Boge 
ee N?—1 (The second moment of 
ee aaa 12 IN| TENMES)) 3 oles ng oo 0-0 oo PATE 
Binal Sloe Vez d? (Spearman’s formula for the 
y iy N (N? — 1) coefficient of correlation 
calculated from ranks) ...[142] 


This formula should not be confused with Spearman’s foot 
rule formula for correlation 


yo) Sep 6z=G (Spearman’s foot rule formula for 
N?—1 correlation based upon the sum 
of the gains in rank)........ [143] 


which has a large, though, except in the case of zero corre- 
lation, not definitely known probable error; does not vary 
between — 1 and +1; is not at all comparable in meaning 
with a product-moment coefficient; and in general has none 
of the merits except brevity, of the formula based on the 
squares of differences in rank. The coefficient calculated by 
formula [142] is usually designated by p, but it should be noted 
that it is identical with rv if ranks constitute the scores. 

Pearson has shown that if scores in the two traits which are 
in truth normal in form are assigned ranks and p calculated, 
it will differ slightly from the r obtained directly from the 
scores. To allow for this discrepancy, p’s may be turned into 
r’s by the formula, 


oS Din e p  (Pearson’s correction to Spearman’s p)..[144] 


That the correction is of small magnitude is shown by the 
accompanying table: 


TABLE XXXVIII 


p if p La 

oo 000 .60 618 
10 105 79 Ty 
20 209 .80 813 
30 B13 .90 908 
-40 416 95 954 
.50 .518 1.00 1.000 


194 STATISTICAL METHOD 


The formula for p is the best of the rank formulas, but in case 
scores constitute the basic data there is always some loss in 
accuracy from warping the data into ranks. The probable 
error of p as determined by Pearson (1907 further) is 

I — p? 
VN 
or approximately 5 per cent greater than the probable error 
of r. 

In case one of the variables is given in terms of ranks and 
the other in terms of variates, we may assign rank values to 
the variates and use formula [142]. If the grouping in the 
variate series is coarse, ranks cannot be assigned without 
losing much of the refinement of the variate data, and if the 
average of a number of ranks is assigned to all the measures in 
one class there is a further error if formula [142] is used as 
this formula presupposes serial ranks from 1 to N. 

To obviate these difficulties it is better to calculate the 
product-moment coefficient of correlation between the ranks 
on the one hand and the variates on the other. Let us call 
this p’, and let r be the correlation if the two series could each 
be expressed in terms of variates and if they constitute a normal 
correlation surface. Then Pearson (1914, ext.) has shown 
that, 


12 E.p = .7063 


(Probable error of p)..[145] 


vias \E - (To deduce r from p’, the product-moment 


3 correlation between a variate series and 
a.tank’series)0y. pre bitin cee eee [146] 
or 
f = 1.0233 p’ 
PROBLEMS 


1. Plot the correlation table giving the correlation between the Thorn- 
dike and Ayres scores in handwriting given in Table XXX, Section 34, and 
answer the question, ‘‘Is the relationship between the two variables rec- 
tilinear?’’ Ans. It is. 


2. Calculate the correlation between series 1 and 2, between series 1 
and 3, and also between series 2 and 3 of the paired practice series given in 
problem 3, Chapter IT]. 

3. Calculate the standard error of r12, the correlation between series I 
and series 2 (a) by formula [108 b], (b) by formula [108 a], (c) by formula 
[108 c] and finally, as the most accurate method of all, (d) by formula 
(108 a] using in addition [108 d]. 


MEASURES OF RELATIONSHIP 195 


4. Rank the measures in these three series and calculate the correla- 
tions pie, p13 and pos by formula [142] 


5. Determine for the first two of these three series the regression equa- 
tion for estimating variable 1 from variable 2 and calculate the standard 
errors of the two constants, by, and c, involved. 


6. In the derivation of by; it was assumed that the regression line passed 
through the means of the two distributions. Derive the same value as 
bs: without making this assumption. 


CHAPTER IX 
FUNCTIONS INVOLVING CORRELATED MEASURES 


Section 54. CORRELATIONS OF SUMS OR AVERAGES 


If the basic means and standard deviations of several series 
of measures and the correlations between series are known, the 
means, standard deviations and correlation of any weighted 
average or sum of these measures with a second weighted sum 
may be determined (Spearman, 1913). Given the several 


series (a+ b) in number, Xi, Xe,... Xa, Xati, Xate... 
Xa+b, With means Mi, Mo,...Ma+s, standard deviations a, 
O2...a+b, and intercorrelations rz, ris... 71 (@+5), T28- + » 


let the standard measures for these variables be, as usual, 


X,;— M, X2— Ma, 
2 hy 8 a ee 
o1 o2 


If a of the measures are combined by adding into a single 
score, and if the remaining b measures are also combined, the 
correlation between the two composites is 


ie Z (21 + 22 + ++ -2,) (87 + 277 + ++ 3) 
VE (ei + 22 +++ 2)? WE (ey t+ ayy +++)? 


Tirt+a+...a)(I+II+...b) 


The product of the two terms in parentheses in the numerator 
gives a binomial of ab terms each of which is a sum of the sort 
Lazer, but 


22121 = Nriz, D2izn = Nrin, etc. 


F ab ab 
Accordingly the numerator equals NS r,9. The symbol S 
1 i 


stands for a double summation, p taking in turn the values in 

the series from 1 to a, and Q in turn the values from I to b. 

The square of the first polynomial in the radical in the de- 
196 


CORRELATED MEASURES 197 


nominator gives a polynomial of a? terms, a of them being of 
the sort Zz", and the balance (a? — a) of the sort zz. But 
237; = N, z 2% = N, etc. 
22122 = Nr, 22123 = Nrizs, etc. 
Further 
2 2122 = D 3224 
and as both of these occur in the summation, there are but 
(@ —a)/2 different product sums involved, though each of 
these is found twice. Accordingly the magnitude under the 
first radical equals 


NSueNS 1 

I f 

: “F ; "bq 

. . eZ . . . ec (a?—a) 

in which Sr is simply 1 added a times so that St =a: S frog 
1 1 a 

is a double summation in which p takes all values from 1 to a, 

and q all values other than p from 1 to a. Thus again each 

r occurs twice, once as rpg and once as 7qp. But an r with 

repeated subscript, such as rp», is not found in the summation. 


The summation under the second radical is similar in type, so 


that 
ab 


S90 
Grae a)i(l a= 1itoe sb)! ae C=e5 (Bob) 
Vo+s a V b+ 8 "PQ 
(Correlation between sums or averages of scores) .[147] 


The preceding formula may readily be generalized so as to 
apply when gross weighted scores are combined. Let w, be 
the weight of X1, w2 of Xe, etc. Then we desire the correlation 
between (wz X1 + weXe+...weXa) and (wXr+wiXi7+... 
wpXp) which may be represented by the symbol 
"(Sw pX p) (SwpX p) 

In calculating the correlation, each variable must be expressed 
as a deviation from its own mean. Accordingly (w,M, + 
wel, + ...WaMa) must be subtracted from the first summa- 
tion variable. This leaves (wyx1 + weve, +... We%a). Simi- 
larly for the second summation variable. Proceeding as before 
we have in place of Zz’ the expression 

z (wix1)? 
and in place of Yzi1%. the expression 

D WiX1W2X2 


198 STATISTICAL METHOD 


so that finally we obtain 


a b 
1(SwpX p) GEESE) 
I 
ab 
e WwW o79¥070"90 


Ks — a) (b? — b) 
Suwtyo%, + W 0 9Ma% a" pq Sutpetp a5) Wpo pWo7 0" PQ 
(Correlation Rien the sums or averages 
of weighted scores)........ eee [148] 


Note that there is nothing in the derivation to prevent certain 
of the weights being negative. If the correlation between two 
series is 7, this is not changed when all the measures in the first 
series are divided by a certain quantity and all those in the 
second by another. Thus in the preceding, division of the 
first series by a and of the second by }, leading to averages, 
will not change the correlation. The formula given is there- 
fore equally applicable whether dealing with sums or with 
averages. 

In case a single score is correlated with the weighted average 
of a number of others we have a situation represented by one 


of the two sums having but one item init. Then the summa- 
b b-bd 

tion S has but a single term and S_ has no terms. Further, 
1 iL 


wio, cancels from numerator and denominator of the right 
hand member. This is the very common situation where one 
variable, which we may call the criterion and represent by Xo, 
is taken as a standard and all the others are combined so as to 
give a high correlation with this one. Under these conditions 
formula [148] becomes: 


a (Correlation between a 


SWpo plop criterion and the 
"xo (SwpX Beg Ae weighted sum _ or 
Suto + = Wo pW Oop average of a number 


Ol Scores) eee [149] 


Since this formula gives the correlation whatever the ow 
products, or the effective weights, may be, one may frequently 
by successive trials hit upon a weighting which gives a fairly 
satisfactory correlation. If two independent variables are in- 
volved and the nominal weight of the first independent variable 


CORRELATED MEASURES 199 


is arbitrarily set equal to 1.0 while that of the second is in- 
determinate and called w we have 
airor + Wo2Ko2 

Vo + woo? + 2 F120 1Wo2 
The multiple correlation rx.(xi+wx. and the weight w are 
the only unknowns in this equation, so it may be plotted on 
two axes, w the abscissa and r the ordinate, throwing into clear 
relief the effect of approximate weightings. Thurstone (1919) 
has shown the value of this procedure. A plot of the following 
data will illustrate the falling off in the multiple correlation 


Yxo(Xi+wX2) = 


obtained as w varies from — .9310, which is the ratio of the 
regression coefficients bo2.1/bo1.2. Given ro. = .4, Yor = — -3; 
(iby = sty Conk == oy" a fe) 
| | 

If w=| —o == PO S|) 1 Nigy |) 19) | — GO QL@©)) — 
Thear,.(x,4+0X) =| -300 .620 .706 | .7826 .7846| .7842 

w=| —.8 | = 5 0 We) |) Weekes 2.0 i 
Peete Zh | .682 .400 056 .000 |— .074 |—.300 


Returning to [149], in case all of the series summated or 
averaged have equal standard deviations and are given equal 
weight, we have: 

W = W2 = +++ Wa=wW 
O01 —=02 =***o2,=6 


a a 
SW pop = wo Sop =awor, 


where 7; is the average correlation of the various series with the 
criterion %o. 


a*—a a—a 
2 WF pW gh ng = enn) Tq = wo? (a? — a) 7; 
where 7; is the average intercorrelation between the several 
original series so that, finally, 

ar, 


"xo (Sw pX p) =e x0(SX yp) a Va+(@—a)r, 


200 STATISTICAL METHOD 


or, 
Ms Me (Correlation between a cri- 
"xo (SwpX ») ~ terion and the sum or 
t 
- cite average of a number of 


equally weighted scores) . .(151] 
If the tests are comparable the several correlations with the 
criterion differ but little and any one of them may be taken asa 
first approximation to 7-, and the intercorrelations differ but 
little and any one of them is a first approximation to ri; also 
SwpXp = af as defined in the next section (55), so that we 
have 


LG (Correlation between a criterion 
AAG Piss fone ae and the sum or average of a 
\ : = +71 number of equally weighted 

similar test scores)..........[152] 


The effective weight given a test is not wy, the nominal 
weight, but wpa, the product of the nominal weight and the 
standard deviation of the scores. Accordingly equally weighted 
scores are those in which the products of the nominal weights 
and the standard deviations are equal; that is, if wo, = wroe 
= w303 = ---, etc., the X,, Xo, X3, etc., series or scores are 
actually weighted equally. This is the condition that must 
hold if the immediately preceding formula is to remain true. 


Section 55. THe RELIABILITY COEFFICIENT 


Let us suppose that the scores combined are those of com- 
parable tests of some single function. If the tests are strictly 
comparable, then in addition to the means, and standard 
deviations being equal 

LASS ee INE Wet a 


c 
and 


A ii R GES ate BY Blas SIN ay 
the correlation between one form or test and a second similar 
form. Let us define a ‘true score’’ as the average score on 
an infinite number of strictly comparable tests. Then the cor- 
relation between the criterion and such a true score, which 
can be obtained by letting a of formula [150] become infinite, 
may be written as 


_ 1 (Correlation between a fallible 


r = Te 
2s Vrot criterion and a true score). . .[153] 


CORRELATED MEASURES 201 


in which « designates the infinite summation. If the relia- 
bility coefficient of the criterion 799 is known, we have, as the 
same sort of formula as [153] 


; Pe etal (Correlation between a true 
Boo Vr00 criterion and a fallible score) [153 a] 


The correlation 7; is that between a test and a criterion, and 
rz is that between two comparable tests and is called a relia- 
bility coefficient. That the notation may be entirely clear, the 
meanings of several symbols as they will be used are here 
listed. ray,af is the correlation between the sum, or average, 
of a measures of a certain sort and A others of the same sort. 
Capital A is used in the second subscript instead of small a to 
indicate that the second series of tests (the same in number 
as in the first series) is different, though similar to the a tests 
averaged or summated in the first series. Whenever a is 
greater than one, the f is kept in the subscript, but when a 
single test is correlated with a single other test, it is dropped, 
and the subscript designates the variable. Thus ry, 177 means 
that an average or sum of two forms of the test (or average or 
sum of two comparable measures of whatever sort they may 
be) are correlated with the average or sum of two other com- 
parable forms and 72;; means that one form of the test 2 is 
correlated with a second similar form of the same test. In this 
latter case 2 refers to the variable, whereas in the former case 
(2 f) the 2 refers to the number of forms averaged or summed. 
The symbol ri represents the correlation between retestings 
with the same form. If the variable X; is a test score the only 
reason 7, does not equal 1.0 is that there is a time interval 
between the two answers, which an individual gives to the 
same question. Similarly ray, af means the correlation between 
average scores upon re-testing with the same a forms. 
Certain very specific conditions need to hold before two 
tests may be considered comparable, and therefore before a 
correlation between two tests can be considered a reasonable 
reliability coefficient. In educational and psychological test- 
ing the first of two similar tests frequently calls forth a response 
which is different from the second. The greater familiarity 
with the form of the test or the difference in interest aroused 


202 STATISTICAL METHOD 


may make the second test quite different from the first. This 
would be especially true if certain elements in the first were so 
similar to elements in the second as to lead to what may be 
called a memory transference from the first test to the second. 
For example, suppose the following questions occur in the first 
and second tests respectively: 

‘“‘(a) John is taller than James and James is as tall as Joe. 
Joe is shorter than Jack. How do John and Jack compare in 
height ?”’ 

““(b) Bessie is brighter than Bertha, and Bertha is just as 
bright as Beula. Beula is not quite as bright as Beatrice. 
Which is the brighter, Beatrice or Bessie?” 

One would expect memory transference, and a tendency to 
solve the second in the same way as the first. We may call 
such a situation one in which there is a correlation between 
errors, meaning that, whatever elements of uncertainty or 
chance operated in the solution of the first question, they 
would tend to operate in the same manner in the solution of 
the second. This situation would tend to make mr too high 
as a true measure of reliability. There are other, and usually 
more important, factors which operate in the other direction. 
Let us suppose the two following questions occur in two forms 
and that they are intended to be comparable: ‘‘(a) Who was 
the first president of the United States?’’ and ‘‘(b) Who was 
the leading batter in the American League in 1920?’’ Passing 
over the possibility of some other question than (a) in the first 
test being comparable to (b) and some other than (b) in the 
second test being comparable to (a), let us consider the com- 
parability of the two questions given. There is certainly no 
memory transference which would help or hinder in answering 
(b) after having answered (a), but the ability to answer (a) 
probably tests special capacity or knowledge which is quite 
different from that demanded for the correct answering of (b). 
In other words (a) and (b) are not samplings of the same 
capacity and two tests made up of questions no more similar 
than (a) and (b) can hardly be considered comparable, and as 
a consequence they would lead to an 1; which would be too 
small. This is the situation which is the more likely and the 
more serious as 7x in this case becomes too large. The 


CORRELATED MEASURES 203 


errors of interpretation due to a too large estimated correlation 
between a test score and a criterion are probably in general 
more serious than those due to a too small estimated correlation. 
The following rule for the construction of two comparable 
tests may be laid down: (1) sufficient fore-exercise should be 
provided to establish an attitude or set, thus lessening the 
likelihood of the second test being different from the first, due 
to a new level of familiarity with the mechanical features, etc.; 
(2) the elements of the first test should be as similar in difficulty 
and type to those in the second, pair by pair, as possible; 
but, (3) should not be so identical in word or form as to com- 
monly lead to a memory transfer or correlation between errors. 
It is obvious that condition (3) is not met if a test is merely 
repeated. Only in case the repetition be at so remote a time 
from the first test that no memory of the earlier response could 
influence the later would there be no correlation between 
errors — in fact even were there no conscious memory of the 
earlier situation there might be a subconscious influence result- 
ing in correlation between the errors. Accordingly the repeti- 
tion of a test to secure a reliability coefficient is to be deprecated. 
However, the repetition of a test to secure an upper limit or 
maximum value above which the true reliability coefficient 
will not lie may be considered to be a sound procedure. 
Spearman (1904 and 1907), who introduced the term ‘“‘relia- 
bility coefficient,’’ used it as here to designate 71, the correla- 
tion between comparable tests, and Brown (1911) used the 
term to mean 7, the correlation between repeated tests. This 
is an unfortunate vitiating of the Spearman concept. Particu- 
larly in view of the fact that a reliability coefficient in the 
Spearman, and not in the Brown, sense, is the one needed in 
all the formulas leading to an estimation of true correlation.* 
It has been pointed out that the correlation between repeated 
tests constitutes an upper limit of the reliability coefficient, 
while the correlation between two forms meeting condition (3), 
but not fully meeting condition (2), would constitute a lower 
limit. Should these two correlations lie close together prob- 
* The unfortunate use of ri as a reliability coefficient given in Brown (rorr) is corrected 


in the later edition as Brown and Thomson (1921) define riz as here used to be the reliability 
coefficient. 


204 STATISTICAL METHOD 


ably an average of them would constitute a close approximation 
to the true reliability coefficient. We may expect in most 
mental and educational test work that the true reliability 
coefficient will be less than the obtained 7, and greater than 
the obtained rz. The lack of fulfillment of condition (1) for 
certain age groups and with certain tests probably at times 
leads to too high a reliability coefficient and at other times to 
one which is too low. 


Section 56. CORRECTION FOR ATTENUATION 


Let us return to formula [147] and write 7p for the average 
of all the r9’s. Then we have 


_ ab 
ab es 2 TrQ- 


Similarly 
< —a 
Cin) ee 
( tpg =S tq 
and 
oye e ce 
Ng ae 
. . : oe) z 3° 
This gives 


ab? 90 

Va + (a? — a)7,, Vb + (0? — b) tpg 

(Correlation between sums or averages of 
equally weighted scores) ............[154] 
If we make both a and b infinite, we obtain an estimate of the 
correlation between a true criterion and a true test score, which 
Spearman calls the value corrected for the attenuation in the 
raw rpg value due to chance errors. Let us designate the 
scores which enter into the criterion as X;, X3, X;, etc., and 
those entering into the composite test score as Xs, X4, X¢, etc. 

Then from [154] we have 


Tate+...a)(I+1I+...) 


oF rie (Correlation between a true 
O60 = : : 
V rig Vr criterion and true test score, 
Spearman's formula for cor- 
rection for attenuation). . .[155] 
or in the previous notation where 72 is the correlation between 
two different measures, 7; the reliability coefficient of the first 
measure, and reir of the second, we have 


CORRELATED MEASURES 205 


The observations as to comparable tests apply equally to the 
securing of comparable criterion scores. In particular if the 
criteria are teachers’ judgments there may be high correlation 
between errors in judgments if teachers have discussed certain 
pupils with each other. 


Section 57. RELIABILITY OF AVERAGES 


Formula [147] for the correlation between sums enables us to 
determine the reliability of the sum or average of a number 
of similar tests, knowing the reliability of a single test. If 
the tests are similar, we may call the successive tests different 
forms of the same test. Then the standard deviations are 
equal; if a straight average is taken all weights equal one; 


a 


and further, if the forms in the S average are similar to those 
1 


b 
in the S average, then every roo = every 1pq = every TPQ 
il 


= 7; — the correlation between one form and a second similar 
one. Let rao be an abridged notation for ra 4 ; that is, 
hae 
for the situation which holds when the scores in both of the 
summations are upon similar tests or forms. This is the cor- 
relation between the average or sum of a forms and the aver- 
age or sum of b others. It is given by 
ab r, it 
CAI N/a + (a? — a) 7,1 V0 + (0? — b) ry 


(Correlation between the average score 
upon a forms and the average upon 


Bbiothers)0. sxe ale arr ore {156] 
If a equals b we have: 
aryy (The correlation between the average 
Taf, Af ~ 7 Tey ee score on a forms of a test and a 
: Other similar fonts) aeeseistr nett {157] 
This formula given by Brown (1911) has frequently been 
called “Brown’s formula.” It is, however, but a special 


case of Spearman’s earlier formula [147]. If but a single 
form of a test is available it may be possible to divide it into 
two comparable halves; for example, one half composed of the 


206 STATISTICAL METHOD 


odd and the other half composed of the even exercises, and 


calculate the reliability coefficient of the half form, 71, 1, or 
2 It 


more simply, rr 1 and then by formula [157] obtain the relia- 
2 I1 


bility coefficient of the single test. 


771 (Reliability of a test determined 
"T= 73 = from the scores on the two 
nse Ll halves) 4 4a ee ee [158] 


I 


Lal 


| 


A second use to which formula [157] may be put is in the de- 
termination of the number of forms required to secure a desired 
or essential reliability coefficient. Solving for a we obtain 


_ af, Af (1 —r,z) (Number of forms required to se- 


pare reap cure a given reliability ror 4,) --[159] 


The use of this equation frequently enables one to determine 
whether it is worth while to attempt to improve a correlation 
with a criterion by increasing the length of the test. If we 
have a problem requiring a correlation of not less than .go with 
a certain criterion, and not permitting a test program extending 
over more than two hours, and if we find experimentally that 
the reliability of a certain 10 minute test is .20 we may deter- 
mine whether it is of any use continuing with this test. The 
test cannot, except as a matter of chance, correlate with any 
criterion to a greater extent than it correlates with a “true”’ 
score of the particular function which it measures. Thus if 
the criterion is the true score in formula [153] then 79 0 becomes 
Yi and 79, becomes 711, so that we have 


Ayes, = Vii (Correlation between one form of a test and a true 
score of the function measured by the test)...... [160] 


Thus in our present problem .90 =V ras, af, OF Taf, Af = .81. 
That is, even if the criterion is no different in its essential 
nature from that which is measured by the test, it is still 
necessary to have a test with a reliability of .81 in order to 
obtain a correlation of .9o with the criterion. Using formula 
[159] we have 
.8I (I — .20) 
7 20(1—.81) — oF 


CORRELATED MEASURES 207 


Thus a test at least seventeen times as long as the one with 
reliability .20 is needed. This would require 170 minutes 
testing time, which according to conditions laid down is out 
of the question, so that there is no use continuing with this 
particular test. This very practical answer is obtained with- 
out any knowledge of the criterion or of the test correlation 
with the criterion. 

Formula [152] aids in determining the fitness of a test for a 
given purpose. Let us suppose that we have three 10 minute 
tests, the first with reliability .80, the second with reliability .40, 
and that these two correlate with a criterion to the extent of 
.30, and that the third test has a reliability of .20 correlating 
with the criterion to the extent of .24. How much will these 
correlations be raised by lengthening and thereby making 
the tests more reliable? Using formula [152] we obtain the 
accompanying table. 


TABLE XXXIX 


CORRELATION OF SCORES OF TESTS OF 
DIFFERENT LENGTHS WITH THE CRITERION 


LENGTH OF TEST arenes ane —- =e 
MINUTES | Jp iisbility .8] | [Reliability .2] | [Reliability .2) 
OH USSG GG 2.5 .24 .18 site 
PO WES  G o 5 27, 24 7 
Single’testa.) . 10 .30 .30 24 
Simm of 2 tests. 20 aa2 36 .30 
Sein ©F ueses — 6 30 2 39 34 
StimrOne5| bests = 9. 50 a3 .42 39 
Sum of 10 tests. 100 538) 44 44 
Sum of 20 tests. 200 Aig} .46 .48 
Sum of o'tests . 34 47 52 


From this table it is apparent that the relative excellence of a 
test in comparison with others is a matter of reliability, cor- 
relation with the criterion, and possibility of increasing or 
decreasing the length of the test without changing its essential 
nature. If the three tests can be lengthened or shortened 
without changing their essential nature then 2.5 or 5 minutes 
testing with test X would yield a higher correlation with the 
criterion than the same amount of time with either test Yor 
Z. ‘Thus if the testing time is less than ten minutes test X is 


208 STATISTICAL METHOD 


the most valuable. If the testing time lies between ten min- 
utes and 100 minutes test Y is the most valuable, and if the 
testing time is over 100 minutes test Z is the most significant. 
The principle here involved may frequently be used in making 
the original selection of one or more tests and before correla- 
tion with a criterion is known. If the testing time is of neces- 
sity brief, give prime consideration to reliability of test; and 
if the testing time is long, give prime consideration to “‘validity,”’ 
to use a term recently employed in psychological literature, 
i.e., to the accuracy and detail with which the test parallels the 
criterion function, and but secondary attention to the reliability 
of the test. If the reliability of the criterion is known the 
correlations of the tests with a true criterion may be obtained 
from the coefficients in Table XXXIX by dividing each by 
the square root of the reliability coefficient of the criterion. 
The resulting table will show even more strikingly than does 
Table XX XIX the relative merits of the three tests. 


Section 58. THE PROBABLE ERROR OF A COEFFICIENT 
CORRECTED FOR ATTENUATION 


The student should carefully note that the coefficient of 
correlation obtained by the use of the Spearman formula for 
correction for attenuation should never be used for the estima- 
tion of one actual measure from a second. This “‘corrected”’ 
coefficient is a promise of the correlation that one might expect 
to find between the variables if one had perfectly reliable 
measures. To use this corrected coefficient in a regression 
equation would lead to a less close fit of the regression line and 
to a larger standard error of estimate of the criterion, knowing 
the independent variable, than occurs when the “raw’’ cor- 
relation coefficient is used. The corrected coefficient of cor- 
relation is mainly of value in theoretical discussions and in 
serving this purpose its divergence from 1.00 is usually material. 
The derivation of a formula for the standard error of a cor- 
rected coefficient is as follows, in which the subscripts have 
the meanings stated at the beginning of Section 56. 

T12 


foun = == 


Viis V ras 


CORRELATED MEASURES 209 


Taking logarithmic differentials, we have 


drwo _ adriz — adris dr x4 
woo 12 2713 2124 


Squaring, summing, dividing by N, we have 
mae ae or or3 ov o4 _ PriorisOri0r13 _ Trier2sO riz r24 YrisraOrnisOres 
Moe 12 47713 4772 T12% 13 1 0f 24 2 Tish 24 


We may obtain Trios by 
formula [128], and all of the o,’s by formula [108 db]. Doing so, 


collecting terms and simplifying yields, 


T ¢ 00 a RAs Rog F281 mer is Tae ed cig ) 


and Trot, DY formula [120], %,,7,, 


Craw = = 
i VN (7712 47713 4 T7204 113 2 Th Se Ale 
_ Ren ps 1719 )+" 12 (I — ris) (1 |! 
Poa 2 I+re 1 \3f 24 


In the notation of this chapter this is 


Tx 00 I 7243 I ) 
Ore — — r2 = Y —— - I 
(ne) /N comes) Ue +(; 7213 4 ae 13 13 


I Pog _ Ve 
+(- Fy 4 + roa Pos I 


(Standard error of a coefficient of correlation calculated by formula 
EG) GONG 3's cove ou SEO Ob NAD Oph os COO OMR OE DOOM OUGE ood Olato [161] 


If we let A,r stand for the first parentheses and Ao for the 
second we have 


a 
= 100 (100° + = cp hne ce Aat) 
12 


The quantities 1/7? and A are tabled for different values of r, 
in Table XL. 

When the corrected coefficient of correlation is calculated by 
formula [161 c], or by 


?. 
T oo a Ee [161 a] 
in which r = (ne + rua + 732 + 734)/4, the standard error of 
Yoo o is smaller than given by [161]. Before calculating this 
standard error let us note that 7 may be expeditiously obtained 
by calculating the correlation between the sum of the two 


210 STATISTICAL METHOD 


tests in the first trait and the sum of the two in the second trait. 

We have: 

Pele Gate D (x1 + x3) (x2 + x4) a ria + 114 + 132 + 134 
VE (x1 +45)? VE (Kota)? 4V (1 +ris)/2V (1 + 70)/2 


r 


ie (1 + ris)/2 V (1 + r2)/2 


so that 


r=r(i +3) (2 +4) Vi +113) /2 V(i+ Fi) (20 ee eee LOLs] 


Thus r may be easily obtained from a knowledge of the relia- 
bility coefficients and of the correlation between the two sums. 
Assuming that the arithmetic average is as reliable as the 
geometric average, then ra calculated by [161 a] has the 
same reliability as roo « obtained from 


_ (rrersarsors4)t + (Yule’s form of Spearman’s formula 
reine a AW ooape I oie for correction for attenuation). . [161 c] 
113 V4 


The standard error of roo ~ calculated by this formula may be 
obtained in a manner very similar to that given in [161]. It is, 
however, a lengthy procedure and will not be recorded here. 
In brief it involves taking logarithmic differentials, squaring, 
summing, dividing by N, substituting values as given by 
formulas [108 6], [128] and [120], collecting terms after assuming 
that ris = 14 = 132 = r34 = r. The answer is 

crm» = 222 (4 ree + ao pee EM aestee : 


00 00 eo 143 P04 


(Standard error of a coefficient of correlation cal- 
culated by formula 161 a or formula 161 c)....[161 d] 


Magnitudes 1/7? are given in Table XL. Study of this formula 
shows that the error in the corrected coefficient is very fre- 
quently not at all large, being in fact much smaller than given 
by Spearman (1910). The disagreement in derivation above 
[161 d] and that given by Spearman (1910, equation 24, p. 294), 
lies in the fact that Spearman, following Filon, to whom part 
of the derivation is credited, used formula [128] throughout, 
whereas formula [129] should at times have been used. The 
realization that this standard error is smaller than previously 


CORRELATED MEASURES Bit 


recognized should throw much new light upon the question of 
the specific or general nature of intellectual functions. 


TABLE XL 

if 1/r? A Yr 1/r2 Al if 1/r2 A 
-OI |10000. 2389. ake) || GlazAs |) > nen Tila) L984 316320 
.02 | 2500. 574. 37 | 7.305\| — 1-541 72 | 1.929 |— 1.316 
OSG 11LT. 243. .38 | 6.92 — 1.556 Sify \) WSIiZh = Weatoye! 
-O4 625. 130. -39| | 6.575 | — 1.568 -74 | 1.826 |— 1.291 
.05 400. 79. AON O250) |) — 57S. ee 5 | 1.77108 | — 10.280 
.06 277.78 51.84 | -4i | 5.049 | —1.584 || .76 | 1.731 ||— 1.267 
.07 204.08 35.80 || -42 | 5.669] —1.588 | 777) 1.687 |— 1.255 
.08 156.25 25.64 11.43 | 5.408] —1.590 | .78| 1.644 |— 1.243 
-09 123.46 18.84 || .44 |} 5.165 | — 1.590 -79 | 1.602 |— 1.231 
-10 100.00 14.10 || .45 | 4.938 | — 1.588 | .80]| 1.563 |— 1.219 
Si 82.645 10.68 | .46| 4.726 | —1.585 | .81 | 1.524 |— 1.208 
a2 69.444 8.14 | .47 | 4.527 | — 1.581 .82 | 1.487 |— 1.196 
Ag} 59.172 O22 Aon. 3400) i — a1. 5 Ono salma 52 1.184 
-14 51.020 4.75.49) 4.065, | — 1.570 84 | 1.417 |— 1.173 
15 44.444 3-59 |} -50 | 4.000 | — 1.563 | .85 | 1.384 |— 1.161 
-16 39.062 2.669} .51 | 3.845 | — 1.555 || .86 | 1.352 |— 1.150 
ly 34.602 E.93 152 G09} 11.5469 S7 aiming e tel es 8 
18 30.864 12832) 5 Guano 500) |= D-5 37a Sou enl-2 Oban in Tenor 
.19 27.701 -843| .54 || 3-429 | — 1.527 .89 | 1.262 |— 1.116 
.20 25.000 440] .55 | 3-306 | — 1.517 | 90 | 1.235 |— 1.105 
220 22.676 -106]| .56 | 3.189 | — 1.507 || .o1 | 1.208 '— 1.094 
p22 20,001 | 9—) 2172) 257 | 3-078 || = 1-496 || .92) | 1.19n |——1083 
28 18.904} — .405].58 | 2.973 | — 1.485 -93 | 1.156 |— 1.072 
.24 17.368| — .601] .59| 2.873 | — 1.474 || .94 | 1.132 |— 1.062 
25 16.000! — .766| .60|] 2.778 | — 1.462 || .95 | 1.108 |— 1.051 
.26 [4,793 |) — .905)||.60 |) 2.687 | — 1-450 -96 | 1.085 |= 1.041 
Py D377 |) — 1.023.602) ||) 2,601 | — 1.439 .97 | 1.063 |— 1.030 
.28 12.755) — 1-122) .63 | 2.520 | — 1.427 .98 | 1.041 |— 1.020 
.29 11.891] —1.207]| .64 | 2.441 | — 1.415 -99 | I.020 |— 1.010 
+30 Tleol Wie 1.278)).05) 82-307 | — 1.402) 1-00)" 1,000) b. 000 
31 10.406] — 1.338|| .66 | 2.296 | — 1.390 

232. F700) al 380i O7i|e2.220 1.378 

58 0,183 | —1.432)| .68 | §2.163 | — 1.366 

34 8.651 | — 1.467] .69 | 2.100 | — 1.353 

5 SLO ACT || 7On 2.040) — 1.34 1 


With probable errors available 


there is no excuse for the 


indiscriminate averaging of corrected coefficients having values 


above and below 1.00, yielding possibly an average nearly 
If we have a corrected coefficient equal to .9o 


equal to one. 


212 STATISTICAL METHOD 


with probable error of .o2, and a second equal to 1.10 with a 
probable error of .o2, we may conclude that neither coefficient 
is a chance variation from 1.00, and further that the funda- 
mental hypotheses of similar tests, lack of correlation between 
errors, etc., underlying the idea of a reliability coefficient, 
must be absent in the case of the data yielding the corrected 
coefficient 1.10. A corrected coefficient greater than 1.00 is 
just as absurd as a “‘raw”’ coefficient greater than 1.00, and if 
positively found, as for example, 1.10 + .o2, it demands a 
reéxamining of hypotheses as truly as would the latter were 
it found to be greater than 1.00. Only in case corrected 
coefficients differ from 1.00 by such small amounts that the 
value 1.00 is well within the likelihood of occurrence, judged 
by the probable errors of the corrected coefficients, is it sound 
to average several such corrected coefficients to secure a measure 
of general tendency? 


Section 59. Estimates oF TRUE SCORES AND THE PROBABLE 
Errors oF THESE ESTIMATES 


Formula [153 a] has value for very practical reasons. For 
example, suppose we know that the reliability of foremen’s 
judgments of the expertness of mechanicians is .36, and sup- 
pose we have a trade test the score upon which correlates with 
the judgments of one foreman to the extent of .48, then, letting 
the foreman’s judgments equal Xo and the trade test score 
equal X, we have 

AONE IRE Es 
V roo Vv. 36 


io = 


Thus the correlation between a single test score and an average 
of the judgments of an infinite number of foremen would be .8. 
If the hiring of a mechanician is not so much for the purpose 
of satisfying a particular foreman as it is to secure expert 
workmen the correlation .80 is not only the one of theoretical 
importance, but is, in fact, the correct one to use in regression 
equations estimating expertness from trade test score. We 
would have, letting x. = the foreman’s true judgment of 
expertness and oo the best estimate of it. 


(Regression of a true criterion upon a 


= * Tx 
vo = Tic —— X11 5 
a1 fallible score) 


CORRELATED MEASURES 213 


The correlation rio9 is given above and co is immediately 

available, for we have, letting s bscripts here indicate scores 

on successive comparable trade tests, 

_ 2% + x. + +++ Xa)? 
N 


—a 


a a2 
=So%+S freqopoq ..[163] 
1 1 


oa 


And if the o’s are equal and r stands for the average of all the 
inter-correlations between the tests this reduces to 
oa = oVa + (a2 — a)r (Standard deviation of the sums 
of a comparable tests).......[164] 
or, dividing by a and now letting o2 stand for the standard 
deviation of the average of a such tests we have, 


% NE ae (Standard deviation of the aver- 
ees a ages of a comparable tests) . . [165] 


And finally if a approaches 


cm =oVr (Standard deviation of the averages of an 
infinite number of comparable tests) .. [166] 
Since og< o, the standard deviation of the true ability of a 
group is less than the standard deviation of the group upon a 
single fallible measurement. Accordingly measures of dis- 
persion based upon single tests are too great to represent the 
true distribution. Estimates of true dispersion are given by 
formula [166]. As is obvious from the derivation, o and r in 
the right hand member should be determined from the same 
population, or at least from two populations which one would 
expect to be equally homogeneous. I have elsewhere (Kelley, 
rg19 meas.), used formula [166] in the process of obtaining a 
measure of true overlapping in ability of two groups. 
Returning to formula [162] we obtain 


Hite te oo (Regression of a true 
I aa ip ee SAL pa 2: criterion upon a 
Oo OL ‘ 
: fallible score) ..... [162 a} 


The reader will of course notice that the right-hand member of 
this equation is the same as that of formula [91 a] which gives 
the regression of a fallible criterion upon a fallible score. We 
thus have, 


Xoo = bors (Regression of a true criterion upon 
Ar ialliblescore) aa eee LOZ NO] 


214 STATISTICAL — METHOD 


and 
Xo = dors (Regression of a fallible criterion upon 


a falliblesscore)|-enenn eee One| 
or the estimated true score is the same as the estimated single 
score. This is, of course, as it should be, and further it leads 
to the interesting fact that the standard errors of estimate in 
the two cases are different. We have 


o0-1= Goko1 = 90 VI — ror 
(Standard error of estimate of a fallible cri- 
terion by means of a fallible score)..... [86] 
—_ 725, 
+1 = Tak o1 = 70 V P00 I — = a V0 — 101 


Too 
(Standard error of estimate of a true cri- 

terion by means of a fallible score) ..{167] 
Thus we are able to estimate the true criterion score with 
smaller error than the fallible criterion. This is very satis- 
fying. It means that in general, trade tests, intelligence tests, 
etc., actually accomplish a more accurate classification of those 
examined than indicated by the correlation with the criterion, 
since the criteria used are regularly fallible. The reliability 
coefficient roo is of necessity greater than 771, but with excellent 
tests and poor criteria it may not be very much greater, so that 
errors of estimate in placement may be small, and in fact much 
smaller than usually conceived. As a practical consequence 
it is seen that a systematic error in a criterion is very vicious, 
but that the chance error has no consequence whatever except 
in the requiring of a larger population in order to establish 
results with equal certainty. 


Section 60. AccuRACY oF PLACEMENT ON BASIS OF A SINGLE 
SCORE 


If in formula [162 a] we make Xo the average of many 
such scores as X,, we have 


or 
Xo= ny X, + (1 — 7,7) Mi (Regression of a true score upon a 
fallible score of the same function) . [168] 
The reason the correlation coefficient has replaced the regres- 
sion coefficient of equation [162 b] is because we are here dealing 


CORRELATED MEASURES 215 


with similar scores, implying equal standard deviations, so that 
rr = by. The accuracy of this estimate of a true score is 
given by 
ot = a1Vi yy hy 
(Standard error of estimate of a true score by means of 
asinglescore on theisame! function) ays cece: [169] 


This formula is very valuable as it enables a judgment as to 
the accuracy of placement. Let us be given an elementary 
school reading test, having a reliability coefficient of .8 and a 
standard deviation of 10 on test scores covering the same range 
of talent as that from which the reliability coefficient was deter- 
mined. If the sixth grade norm, or average score, equals 30, 
the seventh grade norm 38, and the eighth grade norm 46, 
let us determine the standard error of placement of a pupil 
as classified on the basis of the test score. We will first esti- 
mate the pupil’s true score by formula [168]. The standard 
error of the estimated true scores, Xo, is given by formula 
[169]. 
oo-1 = 10V.80 — .64 = 5.0 — 


The standard error of placement of the child is 5 and the prob- 
able error of placement 34, or 42 per cent of the difference 
between grade means. The question raised and answered has 
not involved a criterion outside of the test itself. With refer- 
ence to that capacity which is measured by the test, we can 
say that the error of classification is a certain percentage of 
the difference between norms; or, if the difference between 
grade norms is called a year’s growth, a certain percentage of 
a year’s growth. Much may thus be determined without a 
criterion and this procedure is generally to be preferred to 
dependence upon a criterion having a systematic error, such, 
for example, as would be the case were a teacher to systemati- 
cally judge pulchritude, vivacity, or mere industry, as evidence 
of reading ability. In addition to the simplicity of the method 
just described it may be recommended from the standpoint of 
reliability. The standard deviation of estimated true scores 
(estimated by means of the regression equation) is go 1, and 
the standard deviation of test scores is a1. Accordingly oo «1/01 
is a measure of the proportionate reduction of error in the 


216 STATISTICAL METHOD 


placement of an individual having a given test score, over 
random placement. Th2 smaller this ratio the greater the 
reduction. This quantity has a very small probable error as 
will immediately be shown so that the proportionate improve- 
ment due to the use of a test can be very accurately determined. 

Let ¢6./01 = 1 = the measure of improvement due to the 
use of the test. Noting that the correlation between r and 
r2 equals 1.0 we have 


e@=r—r=r(i—7) 
taking logarithmic differentials, 
2idi _dr dr 


12 r I-r 


Squaring, summing, and dividing by N, 


atte LA aed i Ae Mes Be 
12 re (il —r)? r(1—7r) 
ot, ee = ar 
47? 
_(a+ril1—2r| 
ney 2rVN 


(Standard error of the measure of improvement, over 
random classification, resulting from the use of a score 
of yeliability’¢ [== 771) ow ac nig sean on sae ee [170] 


Note that if 4; = .5 this standard error becomes zero. In 
the derivation of the formula second and higher powers of 
errors have as usual been discarded. Their inclusion would 
show that the standard error of this ratio is a trifle above 
zero when m1 = .5. If the error in 7 is of the order .o2 the 
square is .0004, which is the order of the discarded portion, 
so that no material error is introduced in the formula by the 
omission of second and higher powers of the errors in 7, if N 
is greater than 25. In fact, for ordinary values of 1 we have a 
remarkably small o;. We need not hesitate to place confidence 
in an obtained value of 7, even though the probable error of 
the obtained 7 is rather disconcertingly large. 


CORRELATED MEASURES 217 


Section 61. AVERAGE INTERCORRELATION 


The correlation 717 has occurred in several of the preceding 
formulas. If but two series of comparable scores are available 
this correlation may be calculated in but one way, but if there 
are several comparable series or forms of a test, which have 
been given, there are many ways of calculating the reliability 
coefficient. Having five comparable series of measures 1, Xo, 
%3, %4, Xs there are ro possible pairings of series from which to 
calculate a reliability coefficient. This would in itself be a 
rather laborious task, but if the standard deviations of the 
several series are equal, or approximately so, the average of 
these 10 correlations may be calculated in a single operation 
since formula [163] may be solved for r, giving 


2 . : 
OF (Average intercorrelation between 
ae o @ series, whose means, and 
il a—a standard deviations, are equal) .[171 
? 


The magnitude a is the number of series combined, so that it 
only remains to calculate og and o. If scores for each indi- 
vidual on the a forms are added, a series of N scores is obtained 
whose standard deviation is og. Further the (aN) separate 
scores may all be entered into a single distribution and the 
standard deviation, o, calculated. Thus whenever the means 
and the standard deviations of several series are equal, it is 
practically as simple to calculate the average intercorrelation 
as to determine a single correlation. It will now be shown that 
when ranks instead of scores are involved the calculation of the 
intercorrelation is still more simple. We need o% and oc. It 
has already been determined in Section 53 that if there are 
N ranks, 1, 2, 3,..-.N, their mean equals (NV + 1)/2 and 
their standard deviation 

ad 

V5 
I 


Na 
12 


Let S equal the sum of the a ranks for a given individual, then 


ZS wit) 
Yee 2 


26 Bee ey 
~ IN 2 


a 


Accordingly 


= 


and 


218 STATISTICAL METHOD 


Substituting the obtained values for o? and o*, and simplifying 
gives 
a(4 N + 2) & 12.55? 
nt = DIN = 1) ola a) Nt 
(Average intercorrelation between a series of N ranks). . [172] 


This formula may be illustrated by a problem drawn from the 
writer’s material. Six judges, K, T, U, B, L, H, rank ac- 
cording to merit 12 answers to a given problem as follows: 


Ranks Given by Judges 


ANSWERS K P U B L H S S2 

A I 5 7 10 2 5 30 goo 
B DS 6 4 6 3) 9 30.5 930.25 
C 2.5 3 I 4 I 2 13.5 182.25 

D 4 2 2 II 8 3 30 900 

E 5 12 2 I 4 10 35 1,225 

F 6 I 8 2 5 I 23 529 

G 7 II 10 8 I2 4 2 2,704 

H 8 9 5 7 6 II 46 2,116 

I 9 4 9 12 7 6 47 2,209 

J 10 7 II 5 9 8 50 2,500 

K II 10 12 9 10 12 64 4,096 

L 12 8 6 2} II 7 47 2,209 
20,500.50 

a=6, N = 12, 2 S*? = 20,500.50 
therefore, by formula [172], Try = +3241 


Such a problem as finding the average intercorrelation between 
the ranks of English compositions when 50 compositions are 
ranked by 100 judges would require the calculation of 4950 
correlation coefficients, if no short-cut were available. But 
by the method illustrated the work could be done after the 
tabulation sheet is available in the time that might be required 
for four or five coefficients of correlation. 

Suppose for the data just given it is desired to find out who 
is the best judge. The data are, of course, too scant to answer 
the question but they will illustrate the method. We might 
find correlations rxs, rrs, rus, etc., and consider that judge the 
best who agrees most closely with the composite ranking. 
These correlations would enable a ranking of the judges, but 
they would be spuriously high because the rank of the judge 


CORRELATED MEASURES 210 


himself is included in the S composite. We therefore desire 
either rx(s—x) the correlation of the judge with the composite, 
omitting himself, or (rer+ rku+--+:7rKu)/5 the average of all 
the correlations of each judge with the others. If judgments 
are expressed in the form of rankings, standard deviations are 
equal. The formula derived below will apply not only when 
ranks are used, but to any case in which standard deviations are 
equal. Let « = the-common standard deviation of the rank- 
ings. Let ms represent the correlation between the ranking 
of one judge and the sum of the rankings of all the judges, 
including himself. Let 7.5 —» be the correlation between the 
ranking of one judge and the sum of all the rankings of the 
other judges. Let 
(AD aA aR 89 o SP ae 
goa (@—1) 


represent the average correlation between rankings of judge 
(1) and the other judges, and let 7, equal the average of all 
the intercorrelations between the ranks of the judges. Then 


where p takes all values from 1 to a except the value r. 


I a@—a 
1, =—>=— S Y 
Pai Gg? — a, pq 


where p takes all values from 1 to a, and q takes all values 
except the value p. 
Dx (X11 + x2 +---+ x) OE (CT) Fo 
Pc Ned Nocg Te \/ aera) 7 yg? 
Ler (@ = 1) f,5 
Va + (= 0) Fy 


Plosia DN ieee ales one eee (173] 


TS 


Solving for 71» we have 


r,5 Va + (a? — a)7,, — 1 (Mean correlation between 
Tip = Got one series and (a — 1) 
others, in case standard 

deviations are equal). . .[174] 


The requirement that means shall be equal Is necessary in case 
formula [171] is used for the calculation of rpg. The notation 


220 STATISTICAL -METHOD 


mir was used upon the assumption that the several series were 
similar, but note that m1 of formula [171] and 7p, in formula 
[174] are identical in derivation. The average intercorrela- 
tion 7pq is to be calculated once for all by formula [171] or [172] 
and ms calculated by the ordinary formulas [90], [93], [94], 
[95] or [142] for each successive series. 
Sx1 (x1 - x2 >= x, — x1) 
No? Va + (a2 — @) P pq —-I-2(a— It, 
(CR) 7 p 
\(ai— 1) + (ol @) 2 — 2 fa = I), » 
(Correlation between one series and the com- 


posite of (@ — 1) others in case standard devia- 
tionsiare-.equal) 7 cccn.e: <a eee [175] 


ior(S'— 50) ie 


Genii) = 


Formula [175] involves 71) which is already given by formula 
[174]. Substituting we obtain 
-—(I- r.56 Va + (a? _ a) Tyq ) 
r A — =: 
eu, Va1+a+ (@ — a)? +2 (1 — 1,5 Va + (a — a) Fog) 
(Correlation between one series and a sum or 


average of (a — 1) others if standard deviations 
ATE /ECi1al)) Ry kaw- avs war sicher tee eee ae [176] 


To illustrate these formulas we may study the rankings of the 
six judges K, T, U, B, L, H to answer the question; which 
judge agrees most closely with the composite rankings of the 
others: We have 


Tear 
ree me ae Vas = 3.4521 
r 2 
. Na : [sa] = 13.6885 
ZXRS = (30 + 76.25 + ---) —12 [*2 + =| [° ee 4 
= 454.00 
DxRS 


A similar determination of the other correlations gives the 
table 


rks = .8006 ’Bs = .3086 
'Trs = .6604 ths = .8006 
Tus = -7504 YHs = 6437 


CORRELATED MEASURES 221 


These coefficients establish the order of agreement of each 
judge with the others, but they are spuriously high in that S 
includes the record of each judge himself. We will, therefore, 
knowing by previous calculating, rq = .3241, use formula 
[176] to calculate rxis_x) and other similar coefficients. We 
obtain 


VIG GIS) = .6752 1B (S—B) = .0592 
Tr(S—T) = -4777 TE (S—L) = -6752 


These correlations may be taken at their face value. It is 
seen that judges K and L agree most highly with the other 
judges, while judge B agrees scarcely at all with the average 
opinion of the others. 


Section 62. THE Errect oF DIFFERENT RANGES UPON 
CORRELATION OF SIMILAR MEASURES 


I have elsewhere pointed out (Kelley, 1921 rel.) that a 
coefficient of correlation should be interpreted in the light of 
the ranges of the traits measured. This is true of all correla- 
tions, but it may be most readily proven when dealing with 
reliability coefficients. To quote from the reference cited, 
making such slight changes as are necessary to conform to 
the present notation: 

“The reliability coefficient is, however, not an entirely satis- 
factory measure of reliability, for it is affected by the distribu- 
tion, in the trait measured, of the particular group studied. 
To secure a reliability coefficient of .40 from a group composed 
of children in a single grade is probably indicative of greater, 
not less, reliability than to secure a reliability coefficient of 
.go from a group composed of children from the second to 
twelfth grades. If it is reasonable to assume that in terms of 
true ability the spread of talent is four times as great in the 
eleven grades as in a single grade, the correlation in the second 
case would need to be .gr4 in order to indicate as close a rela- 
tionship as that shown by a reliability coefficient of .4o in the 
single grade. The following formula gives the relationship: 


omen * 3 
Vv. pa — Ry) (Relation between ranges in true 


= ability and reliability coefficients) . [177] 


Pee VR = 757) 


229 STATISTICAL METHOD 


gx and Sg are the standard deviations of the two groups in 
terms of true ability, and 7; and Ri are the reliability coeffi- 
cients of the two groups. Solving this equation for the case 
in which Zoo = 40600, and nr = .40, gives Ry = .g14 

‘Tf the standard deviations of scores in two groups are known, 
it is not necessary to make any assumption; for then the 
following formula applies: 


« Vi-Rit (Relation between ranges in obtained 
Sg scores and reliability coefficients) . . [178] 
L.— rz 


In this formula o and = are the standard deviations of the 
scores in the two groups and mr and Ry the reliability coeffi- 
cients respectively. In passing, it may be noted that this 
equation is an excellent criterion for determining whether a 
test is equally effective in a range Y as in another range o; 
for, if the relationship just given does not hold within the 
probable error of the determination, it is evidence that higher 
correlation is found in one part of the range than in another.” 
The proof of the above formulas is simple. Let o1.~ = the 
standard deviation of an array of single test scores correspond- 
ing to a given true score for the one range of talent and 21.4 
the standard deviation for the second range of talent. By 
formula [86] 
S100 = 61 VI — F109 
but by formula [160], 7160 = m1 so that, 
1.0 = 01 VI — ry 
Similarly 
Zico = 21 VI — Ry 
but if the test is equally as effective in one range as in the 
other the standard deviations of the divergences of the single 
scores from the true scores are equal, i.e., 
Cie = Dies. 
so that 
gine Vi—R (Relation between standard deviations and 
21 Vi—»r reliability coefficients obtained from two 
different ranges when the measure is 


equally reliable throughout the two 
Tanges)\h taf. cia ene ORO eee {178 


* The validity of this equation is briefly discussed by Holzinger (1921). 


CORRELATED MEASURES 223 


Formula [165] enables us to express the same relationship 
dealing with true standard deviations instead of those obtained 
from single tests. Substituting for o; and 21, we have 


To _ ab (1 — R) (Relation between true measures of 

Zo «FN R (1 — 1) dispersion and reliability coefficients 
obtained in two different ranges, 
when the measure is equally reliable 
throughout the two ranges)........[179] 


The fact that correlation changes with range makes comparison 
between reliability coefficients difficult. If one worker reports 
a test as having a reliability coefficient of .4o and a second 
reports a reliability coefficient of .9o for a test purporting to 
measure the same function we are not warranted in concluding 
without further data that the second test is the more reliable. 
For this reason the reporting of standard errors of estimate of 
true scores is to be recommended, for these will not change with 
the range if the test is equally effective throughout the range. 
Knowing the standard errors of estimate we would still be 
unable to compare two tests, if there is no equating of the units 
of the one test in terms of the units of the other. If the first 
worker reports a standard error of estimate for his test of 10 
units, and the second a standard error of 2 units, and if some 
method of equating the scores (see Chapter VI) enables one 
to say that 6 units in the first test are equivalent in range 
covered to one unit in the second, then we can definitely say 
that the first test is the more reliable, for 10/6 < 2 /1. More 
extended discussion of this point is given in (Kelley, 1921 rel:). 


Section 63. Tue Errect oF DIFFERENT RANGES UPON 
CORRELATION OF DIFFERENT MEASURES 


In case two different series of measures are correlated it is 
usually not known just what is the nature of the curtailment or 
extension of the ranges of the two series which has been brought 
about by some selective agency. In illustration; individuals 
of one race are probably less variable with reference to general 
intelligence and also less variable with reference to memory 
ability than humanity in general. But how much the decrease 
in variability is, or whether it is the same in the two functions 


224 STATISTICAL METHOD 


is not known. The correlation between general intelligence 
and memory ability determined from a random sampling of 
one range would probably be smaller than the same correlation 
calculated from humanity in general, but a priori considerations 
would give but a poor estimate of how great the difference is. 
In such a case and without additional data a correction of the 
correlation as found in the one range to enable a comparison 
with a similar correlation as found in the second range is 
impossible. If, however, the nature of the curtailment is 
known and is upon the basis of one trait only we may derive a 
formula enabling a comparison of correlation coefficients ob- 
tained from different ranges. Note that one trait is arbitrarily 
curtailed (or extended) and that the other is affected only ina 
consequential manner. Let x be the variable, the distribution 
of which is curtailed, and let y be the other variable. In the 
non-curtailed, scatter diagram let us suppose the y arrays are 
homoscedastic and show rectilinear regression. The dropping 
out of certain of these arrays, or of random parts of certain of 
them, will not change the slope of the regression line nor the 
homoscedasticity of the y-arrays, but it may be expected to 
change both the slope of the other regression line and the 
scedasticity of the x-arrays. Thus, designating the constants 
of the uncurtailed distribution by capital letters and of the 
curtailed by small letters, we have 


oo. = 221 and by = Bay ......... [180] and [181] 
but 
o1.2% Lie bis ¥ Biz 
a1 = Zi a2 = Le 
and 
re ~ Rie 


By formula [56] we have 


02.1 = Le1 = G2 VI — r 19 =D.VI— R249 
or 
o2ki2 = Y2Ki2 (Relation between correlations and y-standard 

deviations when x-ranges have been changed) . [182] 
Note that formula [178] is but a special case of [182] for by 
letting the first variable be a true score and the second variable 
a score upon a single test of the same function, formula [182] 
becomes formula [178]. We may relate the y-scandard devia- 


CORRELATED MEASURES 225 


tions to the x-standard deviations and obtain a relationship 
between the correlation and the standard deviation of the 
curtailed distribution. By formulas [87] and [180] we have 

ey Oo ie ise eee re [183] 
also 


Squaring, summing and dividing [184] by the population gives, 
for the uncurtailed distribution, 


271 
2. = 
> 12072 — 

o'1 


Substituting in [183] 
a \ e 21 \4 
272 = 9721 + 17712 072 (*) = 072 [a — 142) + 712 (2) | wee . [185] 
Ol OL 


Substituting this value of 2% in formula [182], dividing by oe 


and solving for Ri: yields 


21 
TAD 
a1 


ee ee [186] 
Hs Vi — rie + 112 (21/01)? 


which is the result obtained by Pearson (1903, inf.). 
This may be written in the form 
Ris _ 112 (Relation between correlations de- 
Kye, = R201 termined from ranges whose 
standard deviations in the case of 


the curtailed measure are in the 
ratio D1/01) RPE oni God eta tae 


The only assumptions underlying this derivation have been 
rectilinearity and homoscedasticity in the curtailed trait. The 
standard error in R when thus determined is given in formula 
[300]. The accompanying table is presented to give a concrete 
idea of the differences in correlation that may be expected due 
to differences in range: 


TABLE XLI 
o1 Irr =.1 Y= .2 yr = .3 r=.4 r=.6 r =.8 Ti—205 
5, |THEN R = R= R= R= R= fee R= 
75 133 263 387 -503 707 872 971 
50 197 378 532 .658 832 936 987 
25 373 632 783 .868 949 983 997 
10 709 898 953 975 991 997 9995 


226 STATISTICAL METHOD 


A situation in which the ratio of the standard deviations may 
be determined is when the curtailed distribution is a part of 
a normal distribution. We have already noted [181], that 


risay Rio De eae 
= = ba = Bar 


o1 21 


It is necessary to remember that the first variable x is the one 
upon the basis of which there has been a curtailment of distri- 
bution; that is, whatever difference there may be between o2 
and =; is consequential to an imposed difference in o; and 2}. 
This equation should be valuable in determining which of 
two functions is the more influential in causing selection. 
Suppose that for a narrow and a wide range we find bo = ap- 
proximately By, but that be does not = By. This suggests 
that trait (1) is the causal trait in bringing about the selection 
and trait (2) the consequential trait, or more accurately stated, 
that trait (1) is more closely related to whatever is the cause 
of the selection than is trait (2). Here again the regression 
coefficient is the significant constant for purposes of interpre- 
tation. 

Brown (Brown, Carl—see Yerkes, 1921, pp. 629-632) has 
utilized certain properties of the normal distribution in deter- 
mining the ratios of the standard deviations and therefore in 
determining the correlations in the two ranges. The Division 
of Psychology of the Surgeon General’s Office found that many 
of its intelligence tests showed evidence of a ‘‘jam”’ at one or 
the other extreme; that is, the test was too difficult, resulting 
in large numbers of zero scores, or too easy, resulting in large 
numbers of perfect scores. Except for the extreme scores 
most of the tests gave approximately normal distributions. 
Accordingly the extremes of each test distribution were cut off 
and the correlation for the resulting scatter diagram calculated. 
This is an r from a curtailed distribution. If the ratio 01/2, 
can be determined, formula [186] will give the correlation R 
that would maintain throughout the entire distribution if the 
undistributed extreme scores could be replaced by scores as 
discriminative as those in the middle region of the distribution. 
We can obtain o;/2). 

Let us be given a normal distribution of standard deviation 


CORRELATED MEASURES 227 


‘1 and cut off a proportion p, at the lower end and a proportion 
gz at the upper end, leaving a population of (1 — p: — q@), 
which is the same as (q — qe) in the usual notation as given in 
Sections 24 and 27, from which the correlation r is obtained. 
No curtailment, except consequential, is made in variable 2. 
Let us suppose that the standard deviation of the non-trun- 
cated normal distribution 5, is equal to 1.0. Then a as a 
proportion of 2; is the only constant needed in order to use 
formula [186]. The standard deviation of that portion of the 
distribution, as shown in the accompanying diagram, lying 


between the ordinates x; and x is required. If the equation 
of the total normal distribution is 


Z2=2€ 
the standard deviation of the truncated portion is given by 
Bi) 
if Bx? d¢ 


oy =e SS SE 
Gus 


Xo 
db 2x? dx 
X1 


integrated by parts and evaluated at the limits gives 
4121 — Xe + (q1 <= 2) 


while d, the distance from the mean of the portion to the mean 
of the total normal distribution, is given by formula [55] so that 


121 — X22 2, — 2 ]2 (Standard deviation squared of 
ra [2 — | a portion of a normal distri- 
bution of standard deviation, 

Diy CLEMO) WIA); Ge co Men 8 [188] 


oy 
ee | 
D1 a Of = teh? 


Brown has called the right hand member 1 + /J and introduced 
J into the equation giving r. We will, however, leave formula 
[186] as it is and expect 01/2; to be calculated by the present 


228 STATISTICAL METHOD 


formula [188] in case of truncation at one or both ends of a 
normal distribution and the resulting value introduced into 
formula [186]. Many very neat illustrations of the aid in 
interpretation resulting from the use of this formula are given 
in Yerkes (1921). One word of caution is offered. If multiple 
correlation coefficients are being calculated it is absolutely 
necessary that all the data be consistent. Otherwise such 
absurdities as imaginary correlation coefficients may result. 
Presumably if there are several variables, and every time a 
variable enters a correlation table its distribution is curtailed 
in one certain manner, not only would the r’s, or the correla- 
tion from these truncated distributions be cons stent with each 
other, but also the R’s, or the enlarged correlations found by 
correcting for limited ranges. I have not proven this state- 
ment, but the converse is certainly obvious, that if the cut 
occurs in several places in the several scatter diagrams involving 
a certain variable there is no statistical imposition making 
the 7’s consistent, so that both the r’s and the R’s may be 
inconsistent. On page 633 of Yerkes (1921) occurs a table 
showing that army intelligence test Alpha; was cut between 
scores one and two in one scatter diagram and not cut at all 
in the other correlation tables. There is no evidence that for 
these particular data any inconsistency has been introduced 
by this procedure, but if the correlation had run _ high, 
.990-.999, instead of being less than .98 the lack of a neces- 
sary consistency in the original data would be serious. 


Section 64. Tur Errect oF DoUBLE SELECTION UPON 
CORRELATION OF DIFFERENT MEASURES 


A correction formula is available in case there has been 
selection in both variables. For example, consider a correla- 
tion between heights of brothers and sisters when brothers 
between heights a and b are used and when sisters between 
heights c and d, thus dropping out all pairings in which the 
brother’s height lies outside of abd, irrespective of sister’s 
height, and also all pairings in which the sister’s height lies 
outside of cd irrespective of the brother’s height. Here there 
is selection both in the x trait and in the y trait. Let o; and o» 


CORRELATED MEASURES 220 


be the standard deviations in the unselected distribution and 
let the selection in x alone be such as to change o; to 1, and 
let the selection in y alone be such as to change a to so. Let 
21 be the standard deviation of the x’s and x» the standard 
deviation of the y’s in the doubly selected distribution. To 
point the relation between s; and 2; we may write 


Pam 


the standard deviation consequent to the direct selec- 
tion of the x’s and also due to the indirect effect of 
selection of the y’s. 


si; = the standard deviation consequent to the direct selec- 
tion of the x’s. 


Thus s; is not a standard deviation determined either from the 
original or the doubly selected population. It may, however, 
be determined by formula [188] or otherwise, if the nature of 
the selective agency operating upon the x’s is known. The 
symbols sz and 2» have similar meanings when dealing with 
the y’s. Pearson (1908, inf.), starting with an original, normal 
correlation surface has given formulas showing the effect of 
double selection upon means, standard deviations, and correla- 
tion. Letting 4 = s1/o1, & = Se/o2 and letting small letters 
represent constants in the unselected distribution and capital 
letters in the selected, his formulas may be expressed: 


Sh ES) (Gan hin UO ) Given by Pearson (1908 inf.) [189] 
By =) (Gy Wty @) Given by Pearson (1908 inf.) [189 a] 
my, = ¢ (a1, 02, M1, M2, ti, te, r) Given by Pearson (1908 inf.) [189 }] 
m2 = ¢$ (o2, o1, M2, M1, te, ti, r) Given by Pearson (1908 inf.) [189 ¢] 
tite 
ea Vi = r%yo (1 — #1) VI — rn (1 — te) 
(Relation between y in a normal correlation 
surface and R in the surface obtained 
from the preceding by double selection) . . [190] 


Theoretically one could solve equations [189] and [189 a] for 
t; and tf. in terms of oj, o2, 21, 22 and 7; substitute in formula 
[190] and thus relate R with r knowing the unselected and 
selected standard deviations. However, a solution of the ?¢’s 
in terms of the other constants runs into a bi-quadratic which 
apparently does not simplify so that the symbolic solution 
is not here attempted. The numerical solution for a given 


230 STATISTICAL METHOD 


problem is however possible, so that knowing 2, and 2» the 
ratios t; and tf, may be determined from equations [189] and 
[189 a] or more simply, if the necessary facts as to curtailment 
are known, by formula [188], and substituted in formula [190] 
to obtain R. 

Standard deviations may be either increased or decreased by 
selection due to increasing or decreasing certain arrays. Ac- 
cordingly there is no necessity that 4 or & be less than one, nor 
that R be less than r. Whereas both the regression lines in 
the correlation surface or scatter diagram giving r are recti- 
linear since normality of surface was assumed, in general 
neither regression in the scatter diagram giving R will be 
rectilinear. As a consequence formula [190] is not symmetrical 
with reference to R and r. Selection could conceivably be of 
such sort that both the selected and unselected surfaces were 
normal, in which case the appropriate formula would of neces- 
sity be symmetrical with respect to R and r. The nature of 
the selection which would lead to this result is worthy of 
investigation. 


CHAPTER X 
FURTHER METHODS OF MEASURING RELATIONSHIP 


Section 65. THe Various Ways or MEASURING RELATIONSHIP 


The treatment of the preceding two chapters has shown 
something of the extent and detail of analysis of inter-relation- 
ship between two quantitative variables which are related in a 
rectilinear manner, or at least in such a manner that a simple 
transformation will bring about rectilinear regression. If 
quantitative data are not of this nature, or if the data are 
qualitative, a number of accessory methods of measuring 
relationship are available, none of them, however, permitting 
the detail of interpretation and flexibility of treatment possible 
with rectilinearly related quantitative variables. Three gen- 
eral lines have been followed in developing accessory methods 
of measuring relationship: (1) leading to measures of relation- 
ship which would be identical with the product-moment cor- 
relation coefficient, provided data were (a) recorded in a 
quantitative instead of in a qualitative form and (0) related in 
a rectilinear instead of a curvilinear manner; (2) devising other 
measures of relationship; and (3) interpreting relationship in 
terms of probability. 

The only method of the second and third groups which has, 
beyond cavil, demonstrated itself to be generally serviceable is 
the ‘‘goodness of fit’? method developed by Pearson (1900, 
crit.). However, before treating of these methods we may 
concern ourselves with (1) the measures of relationship which 
are equivalent in meaning to the product-moment coefficient 
of correlation. 


Section 66. THE Mepian Ratio CORRELATION COEFFICIENT 


A method has been proposed by Thorndike (1913), which 
has not as yet been studied sufficiently to establish its compara- 
231 


232 STATISTICAL _METHOD 


bility. with the product-moment coefficient for a variety of 

types of scatter diagrams. In the usual notation 

x/or a) . (Thorndike’s median 

y/o2’ x/o1 eg ratio coefficient of 
correlation). ......[191] 


In using this method some convention must be adopted with 
reference to x/o, y/o, and o/o ratios. In case grouping is 
fine, so that there is the possibility of few such ratios, the point 
is not important; but if there are large numbers of measures 
in the intervals having the means as their class indexes, then 
x/o, y/o and o/o combinations will make for uncertainty in 
results. Calling 1/2 of these equal to » and the other half 
equal to — © will throw the burden of determining r upon the 
remaining ratios and, at least in the case of a normal correla- 
tion surface, this would not introduce a systematic error. If 
the grouping is fine so that the x = o and y = o frequencies 
are lacking or negligible in number, and if the correlation 
surface is normal, then the median ratio for any array is equal 
to the product-moment correlation coefficient, and, of course, 
the median of the ratios for the entire table equals the product- 
moment coefficient. We thus see that for this important cor- 
relation surface, and with fine grouping, Thorndike’s median 
ratio coefficient has the same value as the product-moment 
coefficient. Further investigation of this coefficient is needed 
and, pending it, the method should not be used indiscriminately 
as a substitute for the product-moment method. 

The distribution of ratios is very peculiar and the standard 
deviation of such distribution will generally be infinite, so that 
it is futile to calculate the standard error of the median ratio 
coefficient of correlation. The quartile deviation of these 
ratios, however, is not infinite, and we may take as a first 
approximation to the probable error, 


tj Mr : 
P.B. of + mda rats = ile deviation of ratios 
VN 


(Approximate quartile error of the median 
ratio coefficient of correlation) ........ [192] 


Noting that w/o, y"fos _ x! yy" 
y'/or x" Joy “- y! x!’ 
the median of the [2 aA 
y/o2’ x/oy 


Y(mdn ratio) = Median of ( 


METHODS OF MEASURING RELATIONSHIP 233 


ratios will be closely equal to 


V (mdn of «/ y ratios) (mdn of y/x ratios) 
Thus, we will write, as a very much simpler formula to use, 


rmdn ratio = V(mdn of x/y ratios) (mdn of y/« ratios) 
(Thorndike’s median ratio coefficient of correlation). .[191 a] 


There is a certain directness in interpretation which com- 
mends this coefficient, but even in the form [ror a] it will hardly 
prove more expeditious to use than the regular product-moment 
method, while its probable error will, for usual surfaces, always be 
larger than the probable error of the product-moment coefficient. 

Let us try this method upon the very curvilinear insurance 
data of Chart XXVII. We will use é and ¢ as though they were 
«x and y, deviations from the actual means, for comparison with . 
our other calculations in which they were so used. We have 
the ratios listed below taking the measures by rows beginning 
at the top row. The calculation has been made by a slide rule, 
so that one need not expect an exact check upon every figure. 


TABLE XLII 
iS gc E iG 

ele he g c 5 F 

I 12.9 says} ||| a == eo — 333 

I 2:2 .082 I — 2.7 — .3750 

I 14.0 Lovie || 2 — 2.4 = ili) 

I 9.8 .102 I =) BY — .458 

il 8.3 .120 I 1.6 .616 

2 7:5 133 3 2.74 365 + 

2 6.8 ily || 2 i er ixe) — .058 

rT |—83 — .120 I 1.6 .628 

I oul SLAON |\e2 22 457 — 

B 5.0 PEO. |||, 2 Bui 2H 

I | —5.2 — .193 I Tde7 .086 

I 4.4 .226 I ZO) .330 

I 3.9 258 I 5.10 .196 

I 3.4 .290 I 16.2 .062 

2 2.8 355 2 — 32.3 -031 

I BG 3.000 I 2.61 .383 

2 |~ .25 |—4.000 

Da —e-2 1) —74'.500 a 

Bo |= econ Products ce 

I }—8.0 |— .125 L. Q.—1.215) —.159 SOHO RY SF | SOS: 
Mdn_ 2.675 Wil .297 | 545 + 
WL Os, Soe 2645 e573 1.255 

Ymdn ratio = .545 Quartile error of r = AZ = .122 


48 


234 STATISTICAL — METHOD 


This result, r= .55 -+.12, may be compared with the 
product-moment correlation, 7 = .64 + .06, and the corrected 
correlation ratios, nz = .73 4.05 and 71 = .74+.05. Thus 
for this particular surface in which the regression lines 
do not pass through the intersection of the means, the median 
ratio correlation is less than the product-moment correlation. 
Thorndike (1913) gives an illustration in which the median 
ratio coefficient is 1.00 and the product-moment coefficient less 
than 1.00. No general rule for the relation between these 
two correlations for non-rectilinear and non-homoclitic surfaces 
is offered. 


Section 67. CORRELATION DETERMINED FROM A CURVE OF 
CORRESPONDENCE BY RANK 


This method, which may, more briefly, be described as the 
rank relation method, is proposed by Otis (1916). It prob- 
ably has no essential advantages for rectilinear data, but offers 
promise if regressions are curvilinear. Having a scatter dia- 
gram, a line is to be drawn which will equate scores of the two 
variables. If regressions are rectilinear this line is given by 
the equation x/o1 = y/o2 (see Section 43), but if not rectilinear 
some other device must be followed. Otis writes (1916, p. 
720): “‘In order to get a better idea where to draw the curve 
of relation an auxiliary plot may be made... on the assump- 
tion that the true correspondence of the scores of the two 
tests would be more nearly approximated by that of two scores 
having the same rank than by those of the same child.’ Otis 
does this graphically, smoothing slight irregularities. Having 
this curve of correspondence by rank we may locate a value 
on the x-scale for each value of y (or vice versa) and call the 
obtained value y’; that is, y’ is, in terms of the x-scale, the 
equivalent of y. Thus y’ measures and x measures have the 
same variability and the same mean. Let us designate the 
difference (x — y’) by the symbol dy and designate (y — x’) 
by dy. This enables us to use formula [131] in the calculation 
of the correlation. Otis notes that o4,/ox is approximately 


equal to 
mdn of the | dx’s | 
mdn deviation of | x’s | 


METHODS OF MEASURING RELATIONSHIP 235 


so that, in our notation, 


(mdn of | dx’s|)? (Otis’ deviation formula 
2 (mdn dev. of | x’s|)? Ose CORALS NHOMM)) 5.5 goo [193] 


or, if x-values have been transformed into equivalent y-scores, 
r=1I-— ae CU Gy ee Sie ee a 8 [193] 
2 (mdn dev. of | y’s|)? 
These two formulas are minor modifications of formula [131], 
but Otis’ manner of determining the d’s is unique. These are 
not (X — Y)’s nor even (x — y)’s, but differences when (a) 
unequal variability has been allowed for, and (b) when one 
variable is transformed into a second by means of a curvilinear 
relation line. Thus the so-called 7 obtained is in reality more 
closely related to a correlation ratio 7 than to the correlation 
coefficient 7, but it has an advantage over 7, in that not only 
is the strength of the relationship measured, but the nature 
of it graphically established. The method suffers with all 
graphic methods in not enabling a concise algebraic statement 
of the relations which hold. We may expect the values ob- 
tained by its use to more nearly approach corrected y [200 b] 
than the product-moment r. 

The insurance data of Chart X XVII may be used to illustrate 
the method, but to make it a little more algebraic than graphic 
we will equate measures by the method of Section 35, that is, 
we will call equal percentile values equivalent and will not 


resort to smoothing. 


P=) Lo 


TABLE XLIII 


PER CENT INSURANCE 
CORRESPONDENCE OF WHITE IN FoRCE 
MEASURES BY RANK . 5 eae Fane Soul: 
ER CENT ANK All itemise INT OF 
W. AL OF PAIRED 
PapuL ATION PAIRED In- IN ForcE PER ‘Cant 
Per Cent SURANCE IN WHITE 
Insurance White FoRCE POPULATION 
in Force Population MEASURE MEASURE 
(a) (b) (c) (d) (e) (f) 
341 99 99 99 341 294 
321 99 99 99 285 294 
304 99 99 99 270 294 
Mean) 290 99 99 97 219 294 
294 | 285 99 99 96 192 294 
272 99 99 95-5 190 2904 
272 99 99 90 170 294 
270 99 99 97 224 294 


STATISTICAL METHOD 


TABLE XLIII — Continued 


— CENT pr eimenisy: 
CORRESPONDENCE OF pate a Nie ORE 
Measures BY RaSK | peg Cent [RoweBorere| Se 
WHITE ALENT OF BEML enee PAIRED 
POPULATION | PAIRED IN- IN Force PER CENT 
Insurance Perks ; Fonte sy Soe 
in Force Population MEASURE MEASURE 
(a) (6) (c) (d) (e) (f) 
269 98 98 99 321 247 
254 98 98 98 290 247 
253 98 98 99 272 247 
251 98 98 98 269 247 
247 4 244 98 98 98 253 247 
241 98 98 98 244 247 
237 98 98 98 241 247 
237 98 98 95 182 247 
234 98 98 93 171 247 
227 97 97 99 272 216 
224 97 97 98 234 216 
216 } 219 97 97 97 204 216 
207 97 97 96 197 216 
204 97 97 95 182 216 
202 96 96 98 237 195 
195 { 197 96 96 96 202 195 
192 96 96 95.5 190 195 
190 96 96 94 176 195 
95-5 
190 95 95 99 304 185 
185 4 182 95 95 98 251 185 
182 95 95 98 237 185 
176 94 94 82 140 176 
171 93 93 56 103 171 
170 90 90 88 167 170 
167 88 88 83 142 167 
158 87 87 57-5 105 158 
147 84 84 98 
142 83 83 97 254 147 
207 142 
136 { 140 82 82 97 227 140 
133 Z B 82 54 IOI 133 
) I I 
133 78 78 80 aa Sid 
132 71 71 44 96 132 
726 68 68 67 I2I ic 
: 3 : ef st ot 158 I2I 
§ 5 fe) 133 10 
T05 57-515, 57 57.5 105 Ee 
103 56 56 68 
101 54 54 84 126 103 
96 44 44 71 147 101 
84 43 43 43 132 96 
84 84 


METHODS OF MEASURING RELATIONSHIP 237 


The measures in column (a) are insurance in force scores 
arranged according to magnitude, and the measures in column 
(b) per cent white population scores arranged according to 
magnitude. Column (c) is the same as column (b) and is 
obtained from the first column of Table XXXVII. The first 
entry, 99, in column (d) is the column (b) equivalent of 341 
column (a), which is the measure paired with the first 99 in 
Table XXXVII. As a second illustration; the fifth 99, first 
column, Table XX XVII, is paired with 192. The value 192, 
column (a), is equivalent to 96, column (b), which is accordingly 
the value recorded in column (d) opposite the fifth 99 in column 
(c). The mean of column (d) is equal to that of column (c) 
and except for the slight grouping error in replacing 96 and 
95 by 95.5 and 95.5, the replacing of 82 and 78 by 80 and 80, and 
the replacing of 58 and 57 by 57.5 and 57.5 the standard devia- 
tions are equal, so that we may use formula [131] in calculating 
the correlation. This gives r = .70. 

A similar calculation, interchanging the variables, gives 
columns (e) and (f) and the final correlation r = .65. Com- 
pare this with r = .64, n» = .73 and m1 = .74 of Section 52. 
These two correlation coefficients, or correlation ratios as they 
are more closely related to 7 than to r, should be differently 
labeled. Otis did not point out the fact that there are two 
for each table and that in general they will not be equal. The 
method is still in the elementary stage and needs (a) relating 
with r and with 7, (b) an algebraic method (such as here used 
in equating percentiles, or still better a method resulting in the 
equation of the line of rank relation) for determining the curve 
of relation by rank, (c) determination of the types of correla- 
tion surfaces to which applicable, (d) utilization of coefficient 
and relation line obtained to estimate one variable knowing 
the second, and (e) determination of the probable errors of 
the constants involved. The most interesting feature of the 
method is that but a single relation line is used. However, 
the physical significance of this line will probably not be found 
to be as definite or serviceable as the regression lines of a cor- 
relation table. 


238 STATISTICAL METHOD 


Section 68. CORRELATION Ratio METHOD 


Formula [86] gives the relationship between standard devia- 
tions of arrays and total standard deviation, and the coefh- 
cient of correlation in the case of rectilinear regression. Solv- 


ing this for r? we have 
Cots 71.2 
a7) 


Formula [87] shows that, 07; — 071.2. = are , leading to 


and also 


That is, if the regression is rectilinear the correlation coeffi- 
cient is the ratio of the standard deviation of the means of 
the x-arrays to the standard deviation of the x’s; or it is the 
ratio of the standard deviation of the means of the y-arrays 
to the standard deviation of the y’s. This form suggests the 
use of these ratios when regressions are not rectilinear. The 
resulting values are called correlation ratios and are repre- 
sented by the symbol yn, eta, and note that there are two for 
each scatter diagram. 


Oxy NE _o%ax (Correlation ratio of 
oy Upon) ee eo (LOAT 


=e ys | _ o@*ay (Correlation ratio 

aay on Neat: of y upon x) .. .[194 a] 
The correlation ratio is of necessity greater than zero and less 
than one. The proof of this is left as an exercise. Further, 
dax 18 the standard deviation of the x-arrays around their 
means, whereas o}.2 is the standard deviation of the x-arrays 
around the best fit straight line. The contribution of each 
array to o12 will be greater than the contribution to ogy in 
case the mean of the array is not exactly upon the regression 
line. Therefore cay < o1.2 and as a consequence n > |r|, and 
n’ > r*. The difference between n? and 7? is ¢ and is a measure 
of non-rectilinearity of regression. Therefore the test for 
linearity is 


¢$=7?—r? (Test for linearity of regression).......... [195] 


METHODS OF MEASURING RELATIONSHIP 239 


We need the standard error of this magnitude. Blakeman 
(1905) gives it as 


(Standard error of the 


= Ray aes — art} 5 
“s san LAUR aor Not aie 11) test for linearity) . .[196] 


or approximately, 


= a5 Aree ORR Ts oICACaG CR RR CRT (RATA [197] 


if » and r are not very different. 

The calculation of Cry offers no difficulties. The mean for 
each array is calculated and the standard deviation of these 
found, taking each mean as many times as there are measures 
in the array. If the population is small the data should be 
grouped so that at least two measures are found in each array. 
The scatter diagram on page 241 shows the grouping that 
may be employed for the insurance data of Section 52. The 
class marks to the nearest $1.00 in the insurance in force data, 
and to the nearest 1 per cent in per cent of white population, 
are the means, not the mid-points of intervals, of the measures 
grouped. The origins are, to the nearest $1.00 and 1 per cent, 
the means of the total population. Neglecting the slight error 
due to not keeping fractional parts of the $1.00 or parts of 
I per cent gives the table and calculation on page 241. 

The coarseness of grouping affects the size of ». With 
grouping so fine that but a single measure is found in any 
array, n would then = 1.0 and of course would have no real 
significance. In order to obtain a reasonable value for 7 
grouping should be sufficiently coarse to result in a fairly 
regular, although not necessarily straight regression line. 
Pearson (1911 cor) has pointed out that the significance of 
n should be judged not by its difference from zero, but by its 
difference from the value that is the most probable in case of 
zero correlation between the two variables. Or in other words 
he has pointed out that a correction to the raw eta is necessary. 
Since the standard deviation of means of arrays are of necessity 
positive, this value for finite populations is asa matter of chance 
greater than zero, and if the population dealt with is small 
and the grouping fine it may be very much greater. ihe 
chances are, not only in the case of the zero relation, but 


240 STATISTICAL METHOD 


whatever the relation, that the obtained 7 is larger than it 
would be from an infinite population. Let » be the obtained 
correlation ratio n the most probable ratio from an infinite 
population. And let «x equal the number of arrays; then, 
when the frequencies in the arrays do not differ in a very 
extreme manner from each other we have, as given by Pearson, 
Pd Cte §. 
me Pee 
{a oka (Eta corrected for too 
N fine a grouping). . .[198] 


Coarse grouping was resorted to in the calculation of 7 just 
given for the purpose of eliminating as much as possible of 
the error coming from too fine a grouping. But even so the 
correction is not negligible, since 


(7955)? — <5 
fis = ———3 = 5344, OF ymi2 = «7310 
48 
(8019) —3, 
frnu= ———; or pna1 = -7394 
48 


The correlation ratio does not enable an estimation of one 
variable, knowing a second, as does the regression equation. 
Its value lies in giving a sort of upper limit to correlation. 
The use of some curvilinear regression line or transformation 
line, as in the case of the insurance and per cent white popula- 
tion data of Section 52, may lead to an actual means of esti- 
mating one variable knowing a second. The correlation ratio 
is also valuable as used with the data just mentioned in leading 
to ¢, and to the standard error of ¢, thus determining the likeli- 
hood of violation of data by the assumption of a rectilinear or 
other definite regression line. The standard error of 9 is usually 
taken as 


_1-—~y? (Standard error of the 


“4. /N correlation ratio)... .[199] 


but if 7 is large, due to too fine a grouping and small population, 
the standard error as given by this formula is too small and a 
corrective factor is necessary. 


METHODS OF MEASURING RELATIONSHIP 241 


TABLE XLIV 
we 
PER CENT WHITE POPULATION f & ve f (5)? 
44 | 56 | 69 | 82 | or | 95 | 96 | 97 | 08 | 90 
341 = ih | I I 142 \2t 220.5 
321 ui at eames | I I 122 2 : 
17 
207 I I 2 08 = | 1445 
eel | eure (ic 5°) os 
& | 274 1) me 5 75 ae | SRO: 
a es | ieee = t 31 
X ¥& | 249 i I 3 5 50 oe 192.2 
q Ei eee 
is) 
5 230 I I I I 2 6 31 sae 2606.67 
a 6 
=o | aaa ae al =e 
197 I 2 2 2 Uf 8 ime 357-14 
oan : Papal —5I 
175 r t I 2 I 6 —24 == 433-5 
I47 I I 2 4 —52 3 506.25 
a a ee a = = 
129 | 2 I I 5 70 = 3537.8 
102 i I ne 2 5 —9O7 | 
Files Sear 6 | 1504-17 
| “cae _48 | 7682.93 
nS N + ine) ie) Ww se) + ve) a (o"a) TGOtOGR 
Ce, = 
uo} ¢lslelelels|e]o]e]s Se = 12.6515 
| | | | 
Ee = 51.2760 
' Ke} 
jar? 4["/3|]8| 18/4 8)4Fl° 9] 2] 4] Slo s|* 
| | | | 
w fo} a 
ss a a oe) 2) 9 i) e) ce) w oa 
ee sree sre ie (Se a el 8 
rev) Leal oO oy wn i We} Q > N Oo A 
ee Gb fo} ive) H ° fo} Lal Ni 39) a 
H a H Q H =) 
Sat pase 
ic,2) 
| 


The usual calculation gives o1 = 64.4611 and o2 = 15.7778. 
As calculated in the accompanying table oz, = 51.2760 and 


: 1.2760 12.6515 
oj, = 12.6515, leading to m2 = st2f 15.7778 


54.2700 ee 
Oi ne rere 


= .8019. 


242 STATISTICAL METHOD 


The correction in » for too fine a grouping grows smaller as 
the number of categories decreases and this is as it should be, 
but an improved result is not obtained by a very coarse group- 
ing, as then a correction for too coarse a grouping becomes 
important. This is based on formula [102] and is the same 
sort of a correction as given in formula [103] for a correlation 
coefficient, calculated from the means of the classes. Letting 
cnxy be the value of » corrected for use of class means it may 
be readily shown, as has been done by Student (1913), that, 


_ My (Correlation ratio corrected 
ley ~ i for coarse grouping)..... [200] 
and 
— 9% 
(Nye Sm he eae aereh es seen aes 


Tex 


To apply the correction we need to know rx, and ry,. The 
correlation between the class means and the deviates is 
rx = ox/oy, and for the second variable ry, = oy/o,. Thestandard 
deviations o, and oy have already been determined in the 
calculation of yxy and ny, respectively. Were a normal dis- 
tribution assumed ox/c, could be determined as in the last 
chapter, but, though practically it might lead to good results, 
it is theoretically unsound for most distributions from which 
n is calculated. For the ungrouped data here given o, may be 
determined from the raw data. Calculation without grouping 
from Table XXXVII gives o, = 64.6746 and o, = 15.8646. 
Accordingly 

_ 64.4011 
*X ~ 64.6746 


15-7778 _ 


= .99670 and ryy = 


Thus for the corrected correlation ratio we have 


’ _ tae 7310 > 7350 
FEA at eOA RA 

— "yx _ +7394 
cNyx = T. ,09670° © 74x98 


The values calculated as o, and o, have not been entirely 
freed from a grouping error, particularly oy, since percentages 
recorded in the fundamental table are to the nearest 1 per cent 
only. To correct further it would be necessary to make some 


METHODS OF MEASURING RELATIONSHIP 243 


assumption as to the form of distribution. Plainly the assump- 
tion of a normal distribution for the percentages of white 
population will not be sound. On the assumption that the 
distribution may be represented by a series of trapeziums of 
equal base, Student (1913) shows that the corrective factor is 
V1 + }2/(r2 02) in which h is the unit of grouping and o the 
standard deviation of x in the case of ny, and y in the case of 
Nxy. Applying this further correction to nx, we have 


I = 
ceNxy = 735041 oy X 15.8646 = 1504 


This correction is merely a re-application of the ry, division 
and is warranted due to the fact that division by .99453, the 
Yyy Obtained, allowed only for the grouping of several per- 
centages and not for the error introduced by entering values 
in the original table to the nearest per cent only. For the 
data in hand the only correction really worth while was the 
first, formula [198], that for too fine grouping. The second, 
that for too coarse grouping, willamount to 1 per cent if h = o/2, 
or in the case of a normal distribution if there are some to or 
12 steps, or intervals. This result is obtained by solving the 


equation 
h2 
I =——, = 1,01 
\ Th o* 


A correction for grouping by means of Sheppard’s formula 
[68 a] applied to the standard deviation in the divisor of the 
formula giving the raw n, is appropriate, but no such correction 
for the standard deviation in the dividend is to be made for 
this is a standard deviation of means, or points, and should 
not be corrected by Sheppard’s formula which applies to con- 
tinuous variates. 

As there are so many corrections which apply to 7 the fol- 
lowing summary is given, 

Let oz, = the standard deviation of the means of the x- 
arrays. 

Let ox = the standard deviation of the x’s. 

Then letting nxy equal the raw correlation ratio of the x’s 
upon the y’s we have 


244 STATISTICAL METHOD 


Letting snxy equal the value after applying Sheppard’s 
correction for grouping of the x’s we have, if h equals the 
number of units per group, 


ep 


oF h? h2 
sey == is se thee (: +55) .. [194 5] 
NE ~ 12 ox 
Letting « stand for the number of y categories, N the total 
number of cases and sxy the preceding value of corrected for 
too fine grouping of the y’s, we have, 
FO Mage 
fsV'xy =" G = aliNe ee 
Letting ry, equal o,/o,, ie., the correlation between the 
class means of the y’s and the y variates back of the grouped 
data (note that cy is the standard deviation of the class means, 
but that o, above [194 l] is the standard deviation of class 
indexes), and letting cpnxy equal the preceding 7 corrected 
for too coarse a grouping of the y’s we have 


sing = 4 ifs i ty hele ee ae [200 d] 

In the case of equal intervals in y which are not too large 
(say not > 2), 2, = ay (1 + ea)in which oy is as before the 
standard deviation of means of y classes and h’ the number of 


units per group of y’s, so that 1/r,, then equals 
h’? 
(: + 34 =) 


Vie 
gitey = sey (1 +5) eae ee et LZOORE] 


In [200 c] we may substitute the standard deviation of class 
indexes for oy, the standard deviation of class means, without 
appreciable error, but we cannot make this substitution in 
the general formula, ry, = y/o, [102], which is the formula 
which must be used in case the grouping of y’s is in very broad 
and unequal intervals, and especially if the classes are cate- 
gories not related in a numerical manner. 

These corrections to nxy are not equally demanded in the 
case of any given data. Correction [198] is likely to be the 
most necessary. The finer the y grouping, that is, the larger 


and we have 


METHODS OF MEASURING RELATIONSHIP 245 


the number of y-categories and the smaller the total population 
the more important is this correction. Correction [194 }] is 
important if the x-grouping is coarse and correction [200] if the 
y-grouping is coarse. All of these observations apply to nxy and 
of course similar statements will hold with reference to nyz, if 
in the statements y and x are interchanged throughout. 

The student should note that the value of » used in the 
calculation of ¢, the test for linearity, and in the calculation 
of the standard error of ¢, is the raw value and not the cor- 
rected value. Although the corrected value of » should not 
be used in these formulas [195], [196], [197] as it was not in- 
volved in the derivation of ¢, nevertheless the formula for 
¢ calculated from raw 7 may be expected to give a value which 
is materially too large, and a value for its standard error which 
is relatively too small, if grouping is fine and population small. 
Accordingly the ¢ test for linearity is too rigorous if grouping 
is fine and population small. 


Section 69. MertTHOD oF PARABOLIC REGRESSION 


Many scatter diagrams are characterized by regular curvi- 
linear regression lines. If a single positive or negative curva- 
ture is present the regression line may sometimes be closely 
represented by a parabola, y = a + bx + cx*; and if the re- 
gression line shows a single inflection the cubic parabola, 

y = & + bx =F ox? 4- dx 
may give a good fit. Pearson (1905) has developed the theory 
of parabolic regression and illustrated the procedure with 
certain data. It is too involved to give here, but must needs 
be resorted to if the specific nature of the curvilinear regres- 
sion line and the numerical values of the constants involved 
constitute the crux of the problem. 


Section 70. Bi-SerR1rAL r METHOD 


Tn case one series consists of variates, or graduated measures, 
and the other is dichotomous we may determine the correlation 
that maintains if we assume that the trait represented by the 
dichotomic distribution is in reality a continuous trait, normal 
in distribution, for which we have only categorical information. 
Such a situation is well represented by the following, taken 


246 STATISTICAL METHOD 


from the army psychological test data (Yerkes, 1921, p. 748). 
We may proceed with the steps involved in obtaining the 
numerical value of bi-serial r and consider the general formula 


afterward. 
TABLE XLV 


ts NS nce ARR NuMBER OF MEN WHO LEFT SCHOOL 
INTELLIGENCE TEsT Below the 9th Grade Above the 8th Grade 
205-212 I 
200-204 3 
195-199 14 
190-194 17 
185-189 I 49 
180-184 2 54 
175-179 8 78 
170-174 12 126 
165-169 18 149 
160-164 15 200 
155-159 20 244 
150-154 45 305 
145-149 58 352 
140-144 74 338 
135-139 101 ACs 
130-134 145 507 
125-129 190 528 
120-124 216 530 
115-119 sie 643 
110-114 393 674 
105-109 507 682 
100-104 582 691 
95- 99 761 Sele 
go- 94 908 725 
85- 89 993 769 
80- 84 1,181 693 
Vie (8) 1,371 642 
70- 74 1,604 648 
65- 79 1,709 567 
60- 64 1,962 581 
Soman? 2,249 430 
50- 54 2,272 346 
45- 49 2,42 395 
40- 44 2,455 229 
35> 39 2,473 200 
30- 34 2,490 154 
25- 29 2,213 106 
20- 24 1,835 60 
I5- 19 1,511 42 
1o- 14 545 13 
5- 9 432 5 
Oo- 4 183 3 
34,280 13,822 
13,822 


METHODS OF MEASURING RELATIONSHIP 247 


My, and Mz, are the means of the first and second categories 
respectively, and o is the standard deviation of the total 
distribution (48,102) of measures. Calculation by methods 
already given yield 

M, = 54.987, M2 = 98.758 


« = 36.606 
and finally 


r= 


M2 — M1, pa 


oC Zz 


= .7435- 


With this concrete calculation in mind we may turn to the 
more general statement of the problem. The army Alpha 
series is a variate series, and the graduation or non-graduation 
from the elementary school a categorical series, not correspond- 
ing to a true dichotomy in talent of any sort whatever. Even 
in terms of schooling the two classes are not homogeneous 
within themselves. In the non-graduation class are indi- 


viduals who have been in school variously 0, 1, 2,... 8 years, 
while the completion of the elementary school class comprises 
those who have been in school 9, 10,... years. Thus the 


dichotomy has been arbitrarily imposed upon a continuous 
trait. Let X equal the scores in the variate trait and Y those 


in the dichotomous trait, then r = by = The regression line 
ut 


“¢ 
Ist_Categor, 


< 
Value_of- Variate 


248 STATISTICAL METHOD 


with slope by passes through the means of the x-arrays, %1, %2, 
of the distribution of cases in the two y-categories. Therefore, 
referring to the diagram on page 247, 
2 M1 eat pt oe eta) [Oty 

o1 o2 


Fateahcntad 
ty yo MW Yety 


Now (x2 + %1) is simply (M2 — M)) the difference between the 
means of the x-scores in the two categories, and oj, or simply a, 
is the standard deviation of the total distribution of x-scores. 
It therefore only remains to obtain 


(a) 
Let p be the proportion of cases in the first y-category and g the 
proportion in the second. The distance y is simply the mean 
deviate of the tail of a normal distribution and is given by 
formula [83]. If z is the ordinate, as given in Table K-W, 
at the point of truncation of the normal distribution, cutting 
off p proportion of cases we have 


ae and are so that 7 = 
O1 Dp o2 q 


which may be written 


(M2 — Mi) pq (Bi-serial coefficient of correla- 


o2 t1ON) GE aha ee ee [201] 


This formula differs somewhat from, and is more simple to 
use than Pearson’s (1909), but is identical in the principle 
underlying its derivation. The coefficient as derived has been 
called ‘‘bi-serial r,”” and must be distinguished from “‘ bi-serial 7,” 
described in the next section. 

In case the grouping of «’s is coarse, Sheppard’s correction 
should be applied in determining o. In case the population is 
small there is a chance correlation greater than or less than 
zero dependent upon the point of dichotomy, so that a cor- 
rection of the value just given is necessary. Soper (1014 
bi-ser) gives the following correction formula in which ¢r is the 
corrected value, r the value given by formula [201], x the 


METHODS OF MEASURING RELATIONSHIP 249 


deviate given in Table K—W corresponding to area q, the pro- 
portion g being the smaller of the two proportions p and q. 


per fis Siete (:-€) (048) 42] 


(Bi-serial Roiesten for small population)........ [202] 


Note that for moderate dichotomies and populations greater 
than too this correction may generally be considered negli- 
gible. The square of the standard error of bi-serial r as given 
by Soper is 


i 


(Square of standard error of bi-serial 7)...........[203] 


For dichotomies wherein g is not less than .o5 a close approxi- 
mation to the preceding formula is 


(“29 : r) 
pe z (Standard error of 
Y — 
VN bi-serial r)......[204] 


Even for extreme dichotomies this last formula which gives a 
slightly larger value for o, than formula [203] may well be 
preferred, for the assumption of normality of distribution 
underlying formula [203] is generally less safe in the case of 
extreme than of moderate dichotomies, so that an increase in 
the size of the standard error due to the extra hazard of the 
assumption of normality is desired and this is given by formula 
[204]. Certain of the functions involved in formulas [202] 
and [203] have been tabled by Soper in the reference cited. 
The evaluation of these formulas is also readily accomplished by 
the aid of Table K-W. 


Section 71. Bu1-SERIAL Eta 


The title of the original contribution by Pearson (1910, 
new) describes the data to which this method applies: “On 
a new method of determining correlation where one variable is 
given by alternative and the other by multiple categories.” 
To quote further from Pearson (1917 bi-ser.): “‘Let x be the 
alternative, y the multiple variate, xy the distance from the 
division between the alternative categories of the mean of the 
array of x’s corresponding to a given value of y, yox its standard 
deviation and ny, its frequency. Let x, ox and N be the cor- 


250 STATISTICAL METHOD 


responding quantities for the marginal totals.” To utilize the 
notation of Table K—W, let 
6 = os xy = 28 and K? = ay Sty xy) 
Cx yx 

In the notation of Table K-W, xy is the deviate corresponding 
to gy, the proportion of cases lying above the point of dichotomy 
of the y-category, and x without subscript is simply the deviate 
corresponding to g, the proportion of cases constituting the 
smaller of the two x-categories. The number of cases in a 
y-category is ny and S is a summation covering all the cate- 
gories in the multiple category variate. Thus 


K? — x? ee ie 
aay a | (Bi-serial eta)" gases eee [205] 


There is no correction to be made to this formula on account 
of the x-variate, but correction formula [198 a] should be used 
if x, the number of y-categories, is large and the population, N, 
small; and correction [200 }] or [100 c] should be made if the 
number of y-categories is small. If 7 is small, so that higher 
powers are relatively unimportant with reference to n and n’, 
the standard error of 7 is given by 

I—7? (4 2 px? \} (The standard error of a bi-serial 
NNN See tee) n which is equal to 0)........ [206] 
The magnitudes p, g, z, x are constants of the alternative cate- 
gory distribution having the usual meanings and are avail- 
able from Table K—W when q is known. If 7 is greater than .5 
the full formula for its standard error as given and fully de- 
scribed by Pearson (1917 bi-ser.), is needed. 

We may use data comparing southern and northern negroes 
collected by the Division of Psychology of the Surgeon Gen- 
eral’s Office to illustrate the method. In general the army 
Alpha test was given to literate individuals of greater than 
feeble-minded intelligence, and army Beta or an individual 
test was given to illiterate individuals or to literate persons of 
very limited intelligence. Accordingly a division of individuals 
upon the basis of whether they were tested by means of army 
Alpha alone; or by means of army Alpha and Beta, or army 
Beta, or army individual, will constitute a dichotomy closely 
related to literacy. Table 4, pages 559-60 of Yerkes (1921), 
enables us to determine whether there is a correlation between 


Cn 


METHODS OF MEASURING RELATIONSHIP asr 


negro literacy and domicile as represented by State of the 


union. 


used in the calculation of bi-serial 7 follows: 


TABLE XLVI 
Negro Draft — Pro-rated by States 


The table together with the supplementary columns 


EXAMINATION TAKEN 


STATE Alpha- ny po FA xy NyX?y 
Alpha Beta, Na+ngB 
Only Beta, or 
Individual 

Alabama 271 TOSOM er 359 .1994 .8452 970.82 
Arizona 3 4 7 .4286 .1789 2 
Arkansas 192 706 898 2126 .7926 564.14 
California 31 28 59 .5254 |—.0627 BR 
Colorado 18 12 30 .6000 |—.2533 1.92 
Connecticut 17 28 45 3778 3107 4.34 
Delaware . 40 44 84 .4762 0602 0) 
Dist. of Col. 30 180 210 .1429 1.0669 239.04 
Florida . 499 122 621 .8035 +| —.8560 455-03 
Georgia 416 1,969 | 2,385 .1744 .9385 2,100.67 
Idaho 4 8 12 3333 .4316 2.24 
Illinois . 137 114 251 5458 |—.1156 3.35 
Indiana 74 51 125 .5920 |—.2327 6.77 
Iowa 2 13 36 6389 |—.3558 4.56 
Kansas . 87 30 117 -7436 |—.6557 50.30 
Kentucky 191 341 532 .3652 3451 63.36 
Louisiana 538 LlA7 1,685 .3193 .4705 373.01 
Maine fe) to) (e) 

Maryland 146 379 525 .2781 5888 182.01 
Massachusetts 54 39 93 .5806 |—.2045 3.89 
Michigan 7) 25 42 .4048 2404 2.43 
Minnesota . 9 fit 20 .4500 1257 Be 
Mississippi 773 967 | 1,740] .4443 1408 34-49 
Missouri 196 182 378 5185 -+|—.0476 .86 
Montana 2 2 4 .5000 0000 .00 
Nebraska 13 ra) 26 .5000 0000 .00 
Nevada a. fo) 3% 2 

New Hampshire fo) ii Te 

New Jersey 105 72 tafe 5932 |—.2353 9.80 
New Mexico 3 I 4 .7500 |—.6745 1.82 
New York . 197 107 304 .6480 3799 43.87 
North Carolina | 211 1,168 1,379 .1530 1.0237 1,445.14 
North Dakota . 2 I 3 .6667 |—.4316 50 
Onion .| 163 88 251 .6494 |—.3826 36.74 
Oklahoma 98 211 309 eae 4761 70.04 
Oregon . a 3 .5000 0000 .00 
Pennsylvania 183 236 419 4368 1586 10.54 
Rhode Island 9 9 18 .5000 0000 .00 
South Carolina | 334 1,303 1,637 .2040 8274 1,120.68 
South Dakota . I 15 I .0625 1.5382 37.86 
Tennessee 504 433 937 | -5379 |—-0954 8.53 


* Omitted in totals. 


252 STATISTICAL METHOD 


TABLE XLVI — Continued 
Negro Draft — Pro-rated by States — Continued 


EXAMINATION TAKEN 
Nea 2 
Alpha- n = Xy NyX"y 
i Alpha eta: * ff Na+nB 
Only Beta, or 
Individual 
clexass eee 700 1,048 1,834 .4286 .1789 58.70 
Utah we oe 4 5 9 4545+] .1130 JG) 
Vermont >: fe) fo) fe) 
Virginia . . 56 1,148 | 1,204 .0465-+] 1.6747 3,376.76 
Washington . 7 9 16 4375 .1560 39 
West Virginia . 67 IOI 168 .3988 2559 11.00 
Wisconsin . . 2 5 i .2857 5051 2.24 
Wyoming . . 4 2 6 .6667 |—.4316 Tere 
6,520 13,468 | 19,988=N 11,300.20 
g=.32620, x=.450431 
Z = .360457 K? = .565349 
2 — _-362461 x? = .202888 
7 1.565349 -362461 
n = .481199 
o = .006184 


The bi-serial correlation ratio is less than .50 so that we may 
obtain a satisfactory idea of its probable error by using formula 
[206]. This gives a standard error of .00618 which is so small 
with reference to 7 as to establish the fact that there is a moder- 
ate correlation of about .48 between literacy of the negro and 
domicile. The obtained value should theoretically be corrected 
by applying formulas [198 a] and [200 6] or [200 c]. They 
are entirely inconsequential in this problem, but will be used 
to show the method. The number of categories in the y-variate 
is 45 (number of states yielding frequencies) so that we have, 
applying correction [198 a], 


2 _ (481199)? — 44/19988 
ate 1 — 43/19988 


from which 
f!xy mi -479423 


This correction (.4812-.4794 = .o0o18) is not large, but even 
so it is probably somewhat too great as the 45 y-categories have 


METHODS OF MEASURING RELATIONSHIP 253 


such extremely varying frequencies that the hypotheses under- 
lying the correction are not closely met. The states constitute 
a geographical series and no assumption with reference to 
numerical relationship between them seems warranted, nor 
any assumption as to total distribution on a one dimensional 
scale. However, some correction for coarseness of grouping 
is appropriate. We will assume a rectangular distribution of 
states of equal populations and will not attempt to justify the 
assumption further than to say that the correction that it 
leads to is probably conservative, i.e., too small rather than 
too large, so that our procedure is an improvement over one 
not involving a correction. The standard deviation of a 
rectangular distribution of 45 ranks equals 


V (n? — 1)/12 = V 2024/12 


so that since the unit of grouping is the state, correction 
[200 c] is as follows: making h’ = 1.0: 


I 
flay = 479423 (: a 4048 ) = .479541 


The reader will understand that the number of figures to which 
the work has here been carried and the corrections made are 
for illustrative purposes only and that to meet practical 
demands the raw result, nxy = .481, would be adequate for 
these particular data. We may now turn to a consideration 
of the correlation between two series, the measures of each 
of which lie in alternative categories. 


Section 72. 'TETRACHORIC CORRELATION 


In case we have a 2 X 2 fold table such, for example, as is 
given by indicating the presence or absence of two traits we 
may calculate 7, the tetrachoric coefficient of correlation. 
The assumption underlying the method is that both traits are 
really continuous and normal in distribution and that the 
dichotomies have forced the data for each trait into two 
alternative categories. The procedure was developed by 
Pearson (1900 cor.), and tables of ‘‘Tetrachoric Functions” 


254 STATISTICAL METHOD 


have been calculated by Everitt (1910 — also given in Pearson’s 
Tables 1914 t). Pearson started with the 2 X 2 fold table, 


TABLE XLVII 


a b a+b 
Cc d c+d 
a+e b+d N 


so arranged, as is obviously always possible, thata + b >c +d 
anda+c>b+d. We will start with a table of the same 
sort dealing with proportions instead of gross numbers. Let 

a b c d 
SANE ane oe Ne aN 

a+b ct+td ,, ate , b+d 

 Soccae aes ae Veeco a ey i ematical ee es 

Then our table becomes 


TABLE XLVIII 


a B p 
Y 6 q 
p' q’ 1.0 


Let x and z be the usual quantities obtained from Table K-W, 
knowing gq and let x’ and 3’ be the values obtained knowing q’. 
Then, letting r be an abridged notation for ™; the tetrachoric 
coefficient of correlation, or the correlation as found from a 
four-fold table assuming a normal correlation surface, is given 
by 


6 — qq’ Ae lo a , , ~ 
Bet Pes te 2) 2) ie ae) (Wi 32) 


+- (xt — 6 x? + 3) (x4 6x8 43)5, 
+ (x° — 10 x3 + 15x) (x — 10 x/3 + 15x!) 
+ (x6 — 15 xf + 45 x? — 15) (x? — 15 44 4+ 45 4/2 — 18) to 


(Equation giving r,, the tetrachoric coefficient of correlation). . [207] 


To express the law governing successive coefficients of powers 
of 7 let Unwn/n be the coefficient of r”, v, be a function of x, and 


METHODS OF MEASURING RELATIONSHIP ss 


Wn a function of x’; then v, may be expressed in terms of v’s 
of a lower order: 

Un = XUn—1 — (n — 1) vn—2 and similarly wn = x’wn—1 — (n — 1) Wn—2 
%=1,%=%x and similarly w = 1, w, = x’ [208] 
Thus the equation as written to the 77 term may be continued 
to any number of additional terms desired should it not con- 
verge rapidly enough to make terms above the r’th negligible. 
For small values of r some slight simplification of the work will 
result from using Everitt’s tables (1910). For values of 7 
equal to or greater in absolute value than .80, tables (Everitt, 
1912 and Lee, 1917) giving the 6 for certain assigned 7’s and 
for various dichotomies are of great assistance, as they enable 
a determination of r by interpolation without the extensive 
labor involved in formula [207], or in Everitt’s form of the 
same formula which utilizes his tables. The solution of equa- 
tion [207] for r may follow the usual methods employed in the 
solution of a parabolic equation of higher degree than the 
second, but the method pursued in the following example is 
more expeditious for usual values of r. The data are extracted 
from the findings of the Division of Psychology of the Surgeon 
General’s Office (Yerkes 1021, page 507). 


TABLE XLIX 


ScorE ON Army INTELLI- 
GENCE ALPHA TEST 


AorB Below B 


Departments other than 
First Medical St ar. aus 2940 431 3371 


LIEUTENANTS 


Medical Department . 1799 590 2389 


4739 1021 5760 


Same, expressed as proportions 


.5104 =a .0748 = B 5852 =p 


.3123 = ¥ .1025 =a .4148 = q 


1.0000 


ll 
ra) 


.8227 = p’ 73 


256 STATISTICAL METHOD i 


Entering Table K—W with q we find 
CS RIGS 


z = .389809 
Entering with q’ we find 
x! = .925705 
2’ = .259914 


Substituting these values in equation [207] we have 


.1025 — .0735440 


= a, Gad Se! fF ip 
1013168 r + .099613 r? + .022741 7? + .05255 03195 


+ .0288 r§ +--- 


Solving the quadratic given by neglecting the last four terms, 
gives r = .2781. It is obvious by inspection of the signs of 
the terms neglected that this value is slightly too large. Let 
us therefore assume the value .2770, substitute it for r in the 
last five terms of the equation and solve for 7 to the first power 
for which we have not substituted. Doing so gives r= .2773908. 
The assumed value for r was too small. Let us therefore repeat 
the process, assuming r = .2774. This gives r = .2773741. 
We thus have the following table: 


TABLE L 
Re Laos 10 = 
2770 -2773998 
2774 2773741 


Interpolating between these two pairs of values so as to find 
that value starting with which leads to itself as result, we 
find r = .2773757. Expressed as an equation this value of 
ris given by 

Siete 2773998 — 7 

2774 — .2770 — .2773998 — .2773741 
The work has been carried to seven figures merely to show the 
method, not because such refinement in calculation is neces- 
sary in order to obtain a three or four figure result. 

It will be noted that for this low correlation an excellent ap- 

proximation, r = .278r to the final answer, is obtained by keep- 
ing the first and second power terms only. We thus find the 


METHODS OF MEASURING RELATIONSHIP 257 


correlation between being a lieutenant in the medical corps, 
as opposed to being one in some other corps, and low intelli- 
gence test standing to be .2774. We desire to know the prob- 
able error of this result. The full formula (Pearson 1900 cor.), 
is laborious to use and Pearson (1913 coef.) has given an 
equation which constitutes a close approximation to the full 
formula. We may give certain preliminary formulas. The first 
is Sheppard’s: 
r = cos (278) (Tetrachoric correlation in case both 
dichotomic lines are the medians) . [209] 


If the categories (a + b) and (a+ c) correspond to positive 
deviations in the traits, then the measures represented by the 
a cell are (+ +) measures, those by d (— —), those by b 
(+ —), and those by c (— +) measures. Furthermore 6 
must equal c so that 28 = 8 + y, — the proportion of unlike 
sign pairs. We may call this proportion wu and write the 
preceding formula. 


Pa COS (mE) a sre-erawleites eraus' = yejakaie = taper vines [209 a] 


This very simple formula will give good results if the dichot- 
omies differ slightly from the medians, but it should hardly 
be used if both p and ’ are greater than .55, or if one is equal 
to .s and the other greater than .6. The standard error of 
tetrachoric r when the dichotomies are at the medians is 


t= tis er: (The standard error of tetrachoric r when 
/N B dichotomic lines are at the medians) . . . [210] 


or = 


In case the true correlation is zero then no matter what the 
position of the dichotomic lines 


_ Vpqp'y (The standard error of tetrachoric r when the real 
ea AN Value Our = 100) fe waka Saece oS ee ee ateinee ny [211] 


Finally when the true value of r is not zero, and when dicho- 
tomic lines are not at the medians, we have as a close approxi- 


mation 
- LPF at “pret [1 AG 90° “)]« 2 


(The general cate for the standard error of tetrachoric r). . [212] 


258 STATISTICAL METHOD 


In tne reference cited are to be found tables of V pq/z and of 
the radical function of 7, which will expedite the calculation 
of the standard error. For the probable error of r we have 


pase four Vols Cor) Jt RE 


(General formula for the probable error of tetrachoric r) . . [213] 


The term in braces is tabled herewith. 


TABLE. LI 
Functions Involved in Calculating the Probable Error of Tetrachoric r 


r FUNCTION OF 7 r FUNCTION OF r r FUNCTION OF 7 
.00 .674 .60 -492 .80 6327. 
.10 .670 61 .486 81 .316 
XQ) -655 .62 -479 .82 -305 
25 -645 .63 472 83 -294 
30 .631 .64 465 .84 .283 
35 -615 .65 .458 85 Arg fl 
-40 .597 .66 .450 .86 .259 
-42 .588 .67 -443 .87 .246 
44 .580 .68 -435 88 1233 
.46 -570 -69 .427 .89 -220 
.48 561 .70 .419 .90 .206 
.50 551 yf PATE QI .192 
51 545 re .402 .Q2 .177 
52 .540 ye .393 93 161 
53 ‘535 74 -385 94 -144 
54 +529 75 -376 ‘95 127 
55 523 a7 .366 .96 -108 
56 .517 aha -357 -97 .088 
Ati Aspiai .78 +347 98 .066 
58 +505 79 337 99 039 
59 .499 1.00 .000 


We may use the preceding formula to calculate the probable 
error of the correlation between being a first lieutenant in the 
medical corps and low Army Alpha standing. 


ape .6378 
. n= = <= = nl A 
V5760 X .9397 X .6661 X 1.4656 X .3158 


O156. 


The item .6378 comes from Table LI; 5760 is the population; 
and the other items come from z/p and z/q columns of Table 
K-W. 


METHODS OF MEASURING RELATIONSHIP 259 


Section 73. CORRELATION IN A Four-FoLD PoINntT 
SURFACE 


In case the categories in a 2 X 2 fold table cannot reasonably 
be thought of as indicating different quantitative values of the 
variate, but of necessity as being indicative of qualitative 
differences, we may consider the distribution to be a point 
distribution, 1.e., that the p frequencies are all concentrated at a 
single point and not spread over an interval, and similarly for 
gq, p’ and q’. It will make no difference what the numerical 
value of the difference between the two points of the distri- 
bution is, or in fact whether the value is, in the mathematical 
sense, real or imaginary. So we will call the distance between 
the p and q points j, and that between p’ and q’ points k, and 
calculate a regular product-moment coefficient of correlation 
using formula [93] and taking moments around the intersection 
of the p and p’ category point values. 


djk — (qj) (q’k) 6 6 9g 


if = 
Vaj? — (Gi)? Vq'k? — (qk)? Veg V8'q' 


Algebraic transformation enables the writing of this formula 
in the form 


20 BY. 
Coa) acl Tar = —<—<—<——— 
Ved Pd 
(Product-moment correlation between two point distributions. 
Pearson’s 7,,; or ¢, Yule’s theoretical Valueiot 7:)\ semen. [214] 


Pearson and Heron have called this coefficient the Boas- 
Yulean ¢. For a discussion of it see Boas (Science, May 1, 
1909, page 824), Yule (1912 meth.), and Pearson and Heron 
(1913). This formula may safely be used if the point nature 
of the distribution can be established. It would seem to be 
the appropriate formula in calculating the correlation between 
unit traits; possibly that, for example, between sex and 
albinism. The statistical criteria establishing the point nature 
of the value of a variate are still to be devised. They would 
constitute an important supplement to experimental and bio- 
logical work. Pearson has shown (1900, con.) that raz (in the 
notation of this chapter and of Table K—W this is rsx) is the 
correlation between the means, if measured in terms of the 
standard deviations of their distributions, of two variates of a 


260 STATISTICAL METHOD 


2X 2 fold normal correlation surface and that it is also (1904 
theory), ¢, the square root of the mean square contingency 
of a 2 X 2 fold table without any assumption of normality. 

It is necessary to distinguish between riz and rum, of Sec- 
tion 49. This latter was found to equal m2 [formula 118]. 
But since 


it will be seen that only when division of the means by the 
standard deviations has no effect upon the correlation, would 
The = ‘2. This is not the case for continuous variates, so 
that @ or rn should not be taken as the correlation between 
continuous variates even if they are recorded in a two-category 
manner. The coefficient ¢ is a product-moment coefficient as 
concerns hh and k or discrete variables, but with reference to 
continuous variables it belongs to group (2) which we will 
now consider. 


Section 74. MEASURES OF CORRELATION NOT EQUIVALENT TO 
THE PrRopucT-MoMENT COEFFICIENT; YULE’S COEFFI- 
CIENTS OF ASSOCIATION AND OF COLLIGATION 


Two coefficients developed by Yule may be considered in 
connection with ¢. Using the same notation they are 


ad — be ; 
= le’ i i 
Q pees (Yule’s coefficient of association) . [215] 
vhs Vad =v be (Yule’s coefficient of colliga- 
Vad + Vbc G10n) ares ae eae eee [216] 


Yule (1912) points out that Q is not changed by multiplying 
the frequencies in the various categories. Thus the Q’s for 
the two following tables, the second of which has been obtained 
from the first by multiplying the frequencies in the (a + b) 
category by ten and those in the (b + d) category by five, are 
identical. 


a b 10a 50) 

a d c 5d 
Yule claims this as a peculiar advantage of the coefficient, but 
for a coefficient to be stable under such violent treatment may 


METHODS OF MEASURING RELATIONSHIP 261 


be looked upon as a detriment, as Pearson and Heron (1913) 
have shown. The coefficient of colligation has the value that 
¢@ takes when the 4-fold table is ‘‘equalized’”’ and when the 
classes are given equal or their ‘‘natural’”’ percentages to 
employ the term used by Yule. Thus given the 4-fold 


a b 


C d 


let us multiply the first row, second row, first column and 
second column respectively by the fourth roots of the quanti- 
ties 

cd ab bd ac 

ab’ cd’ ac’ bd 


This gives the “‘equalized”’ 4-fold 
Vad Vbe 
V be Vad 


in which plainly p= gq= ~’ = q' =.5. The correlation —o 
may be calculated from this, noting that 
— Vad B= = Vbe 
acer en ae. 
so that 
ee ad — be _ Vad — Vbe _ 
V (Vad + Vbc\4 Vad + Vbe 


Thus Yule’s coefficient of colligation constitutes a ¢ calculated 
from the equalized table. Conditions which would warrant 
its use as a measure equivalent to a product-moment coefficient 
of correlation are seldom present. They are (a) point distribu- 
tion in the traits and (0) warrant for equalization of the table. 
Warrant for equalizing may occasionally be present; as for 
example, if ten men and 100 women are measured and it is 
desired to find the correlation when the population of men 
and women are equal, but it is difficult to think of a reasonable 


-problem in which there would be warrant for equalizing in 
othe case of both traits. If w has peculiar value, not as a 


product-moment coefficient, but as some other kind of a cor- 


262 STATISTICAL METHOD 


relation coefficient, its physical meaning is still to be demon- 
strated and meanwhile it would seem the part of wisdom to 
limit its use to the narrow field in which conditions (a) and (0) 
are met. A still narrower range of utility for the association 
coefficient Q seems indicated. The great ease with which Q 
and w can be calculated, as compared with 7; and C, the con- 
tingency coefficient, will tempt one to use them for situations 
for which they are not applicable. Yule has derived the 
standard error of @ (see Pearson and Heron, 1913). It is 


om ga fie (ot S)[V3- VEIL Ve VE] 


mae id 6 ee Fe \(z is yt 
rl Oy: . pig s 


(Standard error of ¢ from a 4-fold table)....... [217] 


Although w is a special case of ¢, the multiplication of the fre- 
quencies to obtain the equalized 4-fold table introduces another 
factor so that we cannot in general take o, as being equal to ag. 

The contingency method developed by Pearson leads to two 
constants. One is P, the probability of a situation as extreme, 
as that found, arising as a matter of chance if the two variables 
are in truth uncorrelated; hence if P is small it argues for a 
correlation. The second is C, the coefficient of contingency, 
which under certain conditions is equal to the coefficient of 
correlation which would be obtained from the same data. 
The coefficient of contingency belongs to the first group of 
measures of relationship, but as it is derived in connection 
with P we will consider it here. 


Section 75. MEASURES OF RELATIONSHIP INTERPRETED IN 
TERMS OF PROBABILITY 


The product theorem in probabilities is that if p is the 
probability of occurrence of a certain event and p’ of a second 
unrelated event, then pp’ is the probability of the joint occur- 
rence of the two events. Thus if 30 per cent of a given popula- 
tion have blue eyes and if 50 per cent are males and if eye color 
and sex are uncorrelated, then the likelihood, in making a 
random selection of obtaining a blue-eyed male, is .15 ( =. 30 
X .50) or, in the long run, r5 per cent of the random selections 


METHODS OF MEASURING RELATIONSHIP 263 


will be blue-eyed males. If, then, a large drawing results in a 
proportion sufficiently different from .15 to preclude the pos- 
sibility of chance, the existence of correlation between eye 
color and sex is established. We need to know P, as defined 
in the last paragraph of Section 74, and we desire a measure 
based upon P which is comparable in its general meaning to a 
product-moment coefficient of correlation. Let us be given 
the manifold table. 


TABLE LII 


| designated ns 


N4a Nab 


Na Nb Ne N 


designated 1s’ 


in which n is the number of cases in a category of the first 
variable, us: in a category of the second, and ns. the number 
in the cell given by the intersection of the u; and ny categories. 
There are as many n; frequencies as there are categories in the 
first variable, as many ny frequencies as categories in the second 
variable and as many mss frequencies as the product of the 
number of categories in the first variable times the number in 
the second. If a chance situation maintains, the proportion 
of the whole found in a cell will, by the product theorem, be 
given by 

Eg Le eer eat 18| 


1 TR ASS: 


In general this situation will not maintain, so that the actual 
number in a compartment minus the chance or theoretical 
number, measures the divergence of the situation from chance. 
This magnitude will be designated by dsyv and will be called 


the cell divergence 
en eae ac StS (Cell divergence from chance 
Oe Sis ae eas Xe SiLUATLON aa eon 210) 


The cell divergence is the divergence of a cell frequency from 
a chance frequency when it is desired to compare the obtained 


264 STA TISNICAL ‘METHOD 


situation with the uncorrelated or chance situation; but if it 
is desired to test out some theoretical cell frequencies (m;;’), 
then the cell divergence becomes the divergence of an actual 
cell frequency from the theoretical frequency, or (M%ss' — Mss). 
Therefore we have for the general case 


dss' = Nss' — mss (The cell divergence) ......... [220] 


The square of the cell divergence divided by the theoretical 
frequency (which is usually the chance frequency) will be 
called the cell square contingency, while the sum of all such 
cell square contingencies has been termed by Pearson (1900, 
crit.) the square contingency, and given the symbol x”. Thus 
To d*ss'\ (nss' — mss’)?\ (The square con- 
2 = S (——)] = 8 (| —— : 

(=) ( Mss’ ) tingency)..... [221] 

Obviously 
S dss = 0. 

A measure of total contingency can be built upon the absolute 
values of the cell divergencies, | dss | (Pearson, 1904), but the 
measure of square contingency has superior advantages. 

The square contingency cannot be directly interpreted 
because two factors are involved in it, the number of cells and 
the strength of the contingency. To eliminate the number of 
cells from consideration, Pearson has given the two equations 


P= y2 ea tr Ses 2 {xX este Xe 
th e-3x°*dyx + = 3x rahe cc 


“rans oe = Ci = =) if n’ be even. .[222] 
ree (eee 
2 2:4 2:4-6 2:4:6:-+(n'— 3) 

if 7. be Oddi ccs ae eee 2220) 


in which n’ is the number of cells and P the probability that 
random sampling would lead to as large or larger divergence 
between theory and observation. Elderton (1902 tables, and 
also, Pearson’s Tables) has tabled P for various numbers of 
cells and values of x’. It is thus a simple matter to determine 
the probability of a situation as extreme as the one observed 
(note that this is not equivalent to saying ‘the probability of 
the observed situation ’’) arising as a matter of chance. There 
is no assumption of normality in the determination of x2, but 


METHODS OF MEASURING RELATIONSHIP 265 


in deriving the equation giving P from x? it is assumed that 
the frequencies in a cell resulting from successive samplings 
form a normal system of variates. This is entirely different 
from the assumption that the categories are classes in reality 
constituting a normal distribution. It is because of avoiding 
any such assumption that the contingency method has its 
chief value. The assumption that, within a single cell, the 
results of successive samplings will constitute a normal distri- 
bution of frequencies, may regularly be expected to hold, 
provided p, the probability of a measure being in a cell, is not 
so small but that (p + q)” can be approximately represented 
by a normal distribution. As a practical matter (p N) the 
theoretical number of cases in the cell should not be less than 
1.00. If the categories are such that the theoretical frequency 
in any cell is less than 1.00, two or more categories should be 
combined so as to give cells with theoretical frequencies 
greater than 1.00. Asa very minimum, not to be approached 
if avoidable, the smallest theoretical frequencies should not 
be less than .7. 


Section 76. EQUI-PROBABLE rf 

In case p is very small, its meaning is difficult to interpret. 
Pearson (1912 novel) has pointed out that the improbability 
of the obtained 4-fold arising as a matter of chance is equal to 
the improbability of a tetrachoric coefficient of correlation of 
a certain magnitude based upon the same number of cases, 
and Pearson and Bell have provided tables (see Pearson’s 
tables) whereby a P calculated from a 4-fold table may be 
used to determine an equally improbable tetrachoric coeffi- 
cient of correlation. Pearson does not recommend this method 
of interpreting P in case of extreme dichotomies, or in any case 
as being preferable to tetrachoric r. 


Section 77. MEAN SQUARE CONTINGENCY AND COEFFICIENT 
OF CONTINGENCY 


We have obtained a measure of probability, P, from the 
square contingency x’. We may also interpret the results by 
means of a coefficient of contingency. The most valuable 
form is that derived by Pearson which he has called C. and 


266 STATISTICAL METHOD 


which we will here call C. We will first need the mean square 
contingency. Designating it by ¢*? we have 

¢? = x (Mean square contingency) .[223] 
The magnitude ¢ as thus defined is identical in the case of a 
4-fold table with ¢ of formula [214]. As here defined it is 
obtained from a manifold of any number of cells. As has been 
pointed out in the case of a 4-fold table, ¢ is not a coefficient 
of correlation of a graduated or continuous variate, nor is 
the function 


as \ x? (Coefficient of 
1+¢ YN-+ x? contingency) . [224] 
but the latter is comparable with it. In fact, if for each vari- 
able the categories are successive values of a graduated variate, 
and if the population is large and the number of categories 
great so that there is not a grouping error, and if the correla- 
tion surface is normal, then C is identical with the product- 
moment coefficient of correlation. 

As a measure of relationship between continuous variates 
there are two corrections which should be applied to C, one due 
to number of cells and the other a correction for class index. 
(Pearson and Heron 1913, page 217.) 

If k = number of rows and \ = the number of columns and 
if the frequencies in the categories do not differ one from 
another in an extreme manner, the corrected mean square 
contingency, .¢*, is given by 


C= 


gs x? — (« —1) (A—1) (Value of ¢? corrected 
€ N for number of cells). [225] 


In case broad categories are used there are wide differences in 
the measures within a category and these may be differently 
grouped for the successive cells of a single category, so that 
there is a correction for class index needed (Pearson 1913, 
meas., page 130). This correction does not apply to ¢ which 
is, in the case of a 4-fold, the correlation between points and 
may be thought of as a similar sort of a function in the case 
of a manifold of a greater number of cells; but it does apply 
to C, the coefficient of contingency, which aims to measure the 
relationship between continuous or graduated variates. Thus 


METHODS OF MEASURING RELATIONSHIP 267 


we will consider the uncorrected C as the correlation between 
class means and correct by formula [103] where ry, and ,y have 
the meanings defined by formula [102]. The student must 
not confuse x of formulas [192], [103], [226] and [226 a] with 
the mean square contingency, x? of formula [224]. They are 
entirely unrelated. Applying the correction, we have 


= 5 AE (Coefficient of contingency cor- 
Vxextyy rected for class means)..... [226] 


We must now obtain values for the correlation between the 
variates and the class means, ryx and ryy. The preceding 
formula may be written in a form similar to formula [103]. 


mC = EPL GR NESE oc) So [226 5] 


Oxo y 


Note that the assumption of normality implies that the cor- 
rective factor 1/(cx0) is as great for the problem in hand as it 
would be were the distribution of the two traits normal. In 
other words we assume normality only in the problem of 
determining the corrective factor and not in the determining 
of C. Wide divergencies from normality would probably 
amount to very little so far as the corrective factor is con- 
cerned, and as it is necessary to make some assumption in 
order to determine this factor we can do no better than assume 
normality of distribution. Doing this we find cx and oy as was 
done in Section 47. Should we not wish to make the assump- 
tion of normality we may assume a rectangular distribution 
and find the correlation between class means and variates. A 
rectangular distribution of « units length has a standard devia- 
tion of Vx?/12 and the standard deviation of the means of a 
rectangular distribution « units in length divided into «x equal 
intervals is “x(k —1)/12. Thus the corrective factor is 
determined from 


(Correlation between class means and variates assuming 
a rectangular distribution of « equal sub-ranges) .....[227] 


If x is the number of categories in the first variable and \ the 
number in the second, the total corrective factor is 


KX 
a Geta tes Meet stee nae [228] 


268 STATISTICAL METHOD 


This correction is larger than the one based upon the assump- 
tion of normality and probably is in general less sound. The 
following table is given to show the magnitude of the corrective 
factors upon various assumptions and to provide ry, when 
certain assumptions are reasonable without entailing the 


detailed calculation. 
TABLE, LIL 


Value of ryx, the Correlation between the Class Means and Variates for 
Different Groupings 


EQuAL RANGES. Equa. Sus- Equal RANGES. | EQUAL RANGES. 


NUMBER OF FREQUENCIES. 
Cisgses | | NONE DU | Nossal Dis, | hence) | eer 
2 .798 -798 Ag hove .589 
3 .872 .891 .816 .842 
4 -923 -928 .866 -915 
5 -949 -947 .894 -946 
6 964 959 -913 963 
8 979 .972 -935 979 
10 -986 979 949 -987 
15 993 -988 -966 -994 
20 -996 -992 975 -997 


The values in the last column have been derived upon the 
assumption that a parabola would well represent the frequency 
surface of any three neighboring classes. In the calculation 
of the first and last columns of this table it has been assumed 
that the total range was equal to 5.6 standard deviations which 
would approximately be the case in a normal distribution if 
the total population is 100 (see prob. 1, Chapter V). Pearson 
(1913 inf.) gives a table containing in part similar information 
upon the assumption that the total range equals 6.0 standard 
deviations which is approximately the case if the total popula- 
tion is 185. The corrective factors given in the 1st, 2d, and 
4th columns are nearly equal to each other if the number of 
classes is greater than three, so it makes little difference which 
of these three hypotheses is assumed in determining this cor- 
rective factor. The assumption of a rectangular distribution 
leads to quite different results throughout the entire length 
of the table. 

We have considered two corrections, one the correction of ob 
for number of cells and the second a correction of C for use of 
class means instead of variates in the classes. One further 


METHODS OF MEASURING RELATIONSHIP 269 


important item is the probable error of the contingency coeffi- 
cient. Much study of this point has been made (Blakeman 
and Pearson 1906 and Pearson 1015, prob.) and certain of the 
methods obtained are involved. The method here given, de- 
rived by Pearson (1915 prob.), is fairly simple, involving the 
calculation of but a single additional constant y°. Let the 
cell ¥? function be defined by the equation 
(dss’)® 
(mg 
and let y? be the sum of such functions for the entire table, 
divided by the population, thus 
ie = Is (a: (v3 function required in finding the 

N (mss’)? probable error of ¢ and of C)...[230] 
Having ¢? and y? we may obtain the standard error of ¢ from 
the formula 


Cell y? function = eee 225 


opt See yf — “)' Standard error of 225 
co} i poe ¢ (Standard err pi) tees (231] 


Further, having C we obtain 
(© ay ae a) (Standard error of the co- 


san ¢ efficient of mean square 
WSN “a + 62)3 CONLIN CHCy,) ae ee 2321 
We may illustrate the calculation of ¢, C, a and the corrections 
to ¢ and C by the following data taken from the army psycho- 
logical findings (Yerkes, 1921, page 825). 


Cc 


TABLE LIV 
BAKER NA eens BARBER Boos BUTCHER 
294. 289. 275: 450. 370. 1678 
323.5 262.9 321.8 390.9 378.9 
Tested by |— 29.5 26.1 — 46.8 59.1 — 8.9 
Army 2.690 2.591 6.821 8.935 .209 
Alpha — .245 .257 — .992 1.351 — .005 
85. 19. 102. 8. 74. 288 
55-5 45.1 55-2 67.1 65.1 
Tested by 29.5 — 26.1 46.8 — 59.1 8.9 
Army 15.680 15.104 39.678 52.054 Py 
Beta Si ayeyil, || = Sayfa 33-640 |— 45.848 .167 
379 308 377 458 444 1966 
x? = 144.979 Ny? = — 12.082 
= 07374 = — 006145 
c= .2621 De = o1s61 
P.E.c = 0126 


270 STATISTICAL METHOD 


The 5 entries in each cell are, in order, as follows: 


Mss! The frequency found in the cell 
mss’ The theoretical cell frequency 
dss’ The cell divergence 


The cell square contingency 
Mss! 


(dss’) = dss! 
Ge 
Mss’ Mss’ 


The cell y? function 


It should be noted that the ¢? used in the calculation of o; is 
not the corrected value. We may, however, with insignificant 
error consider o, to be either the standard error of the raw or 
the corrected coefficient of contingency. The mean square 
contingency corrected for too fine grouping, ,¢’, is, by formula 
[225], 


a eee I ae) 
ae =o N 


Donk 
= 07374 — a. = .07171 


The corrected coefficient of contingency depends upon the 
correlation between class means and variates. Let r,x stand 
for this correlation in the case of the test series and let 7, be 
this correlation for the vocation series. It is very difficult to 
make an assumption as to the distribution of the variates 
within the vocational categories. However, assuming ‘‘equal 
ranges any type of frequency’’ we find from Table LIII that 
Yyy = .946, for a five category series. The assumption of a 
normal distribution for the other variable is reasonable though 
we cannot expect the most reasonable of assumptions to give 
a very reliable corrective factor from a two category distribu- 
tion. We have 


TABLE LV 
NUMBER PER CENT z ere oa Clise 
Test by Army Alpha 1678 85.36 .257 
.22 
Test by Army Beta 288 14.64 3 — 1.567 


ox = V8536 (.257)? + .1464 (— 1.567)? = .6449 — 


METHODS OF MEASURING RELATIONSHIP 271 


and since oy, = 1.00 we have 


645 _ 
Boon 7 4° 
Thus finally 
cos 
notin, mT ENTE ee 
1x xPy 6449 X 946 “74 


P.E. of mC = approximately .o204 as determined from the 
proportion 
.2621 - 4240 


0126) © P2BVor me 


This completes the solution, and for the problem in hand we 
may conclude that there is a small correlation of .424 between 
trades considered and literacy and that this is established 
with a very satisfactory degree of certainty. 

The reader should note that the corrected value of c differs 
materially from the raw value. 


Section 78. VARIATE DIFFERENCE METHOD 


The variate difference method was first used by Miss F. E. 
Cave, in 1904, in a study of the correlation of barometric 
heights, published in the proceedings of the Royal Society of 
London, v. 74, pp. 407. The object of this study was to get 
rid of seasonal change by correlating first differences of readings 
as obtained at two stations. Later, Hooker (1905, Jour. of 
the Roy. Soc., v. 68), Student (1914), Anderson (1914), 
Beatrice M. Cave and Pearson (1914) and Ritchie-Scott (1915) 
have further developed the theory and illustrated its use, 
and Persons (1916), (1917) has noted certain of its shortcomings. 
There is still much to be done in establishing its degree of 
applicability to short series such as are usually available in 
material influenced by spurious time and space factors. 

If barometric heights constitute the data and a large number 
of measures are available, there is little doubt but that the 
method will give the correlation between the readings at two 
stations independent of spurious space or time factors; but 
if two series of yearly price indexes, extending over years, 


272 STATISTICAL METHOD 


where 1 is small, are correlated by the variate difference 
method; (a) the probable error of the correlation obtained is 
not definitely known, (b) the number of differences which it 
is desirable to use is uncertain, and (c) the relation between 
the applicability of the method and the size of is not estab- 
lished. Cave and Pearson (1914) consider good results to be 
obtained by going to fifth or sixth order differences when 
dealing with eleven commercial indexes, each extending over 
28 years, but this point is not indubitably established. The 
problem shortly to be presented to illustrate the method is 
equally extensive in time, but the real relationship between 
the variables, independent of time, can hardly be said to be 
apparent. The treatment of the following sections will be in 
the order, (a) notation, and tests of applicability, [1] by com- 
parison of standard deviations of successive difference series 
and [2] by the stability of the successively obtained correla- 
tions; and (b) illustration by a problem. 

(a) Given two series, 11, %2,°°*%n and 41, e,---Vn between 
which there is an organic correlation, R, and a spurious cor- 
relation due to a time or location factor such that the two 
phenomena together result in an apparent, i.e., an obtained 
correlation, of r. The problem is to determine R. Student 
(1914) has shown that if 


x1 = XxX, + bt, -f- ct? Lt dt?, ok etc. 
x2 = Xo + dt, + ct%. + dtg + etc. b............ [233] 
etc. 

and if 
vir Y; -+ Bt, fe Ct*; + De; + ce 
yo = Vo + Bto + Ct?s + Dt. + ete. yer seh tc Easy fl 
EtG, 


in which Xj, X2, etc., Y1, Yo, etc., are independent of time or 
location, then, if the parabolic equations in ¢ terminate with 
some power ?”, the correlation rxy is given by the correlation 
between A, and 6,, the two series of n-th order differences, 
A, standing for the measures (4% — 2), (x, —%3)... 
(Xn-1 —%n); Ae for the measures [(%1 — x2) — (xe — x3)], 
[(%2 — xs) — (ws — x4)], .. [Gn—2 — %n—1) — (n—-1 — %n)]; and 
similarly A; for third order differences; A, for fourth order 


METHODS OF MEASURING RELATIONSHIP 273 


differences; etc.; the 6’s having comparable meanings in the 
case of the y-series. Cave and Pearson have noted that in 
this equation the ratio 


Uieen 


"Am +1 
and that, therefore, starting with a series in which measures 
are not independent but influenced by a time factor which 
can be expressed, as suggested, by a terminating parabolic 
series, taking successive differences and calculating the standard 
deviations of the difference series, one should obtain, as soon 
as sufficient differences have been taken to eliminate the 
spurious time factor, standard deviations bearing the ratio 
indicated. This accordingly constitutes a test in a single 
series of the number of differences which are required to eli- 
minate a time or space factor. Cave and Pearson applied this 
test to the eleven series with which they worked, but did not 
succeed in establishing the number of differences necessary to 
eliminate the time factor. They attribute their failure to the 
small period studied. However, 28 years is, as economic 
data run, a fairly long period. Some method, — partial cor- 
relation, variate difference, or what not,—to eliminate an 
annoying time factor, for data covering such or a shorter 
period, is greatly needed. 

The approach of the ratio of successive standard deviations 
of the difference series of the single variable to 4 — 2/(m + 1) 
is the first test of the possibility of eliminating a time or space 
factor by dealing with differences. 

The second test lies in the stability of successive correlations 
between differences, of equal order, of the two series. Thus, 
if try % YA 7 Vas, but, very approximately, Taw = Tass 
= ra, it would be concluded that the time or space factor 
had been eliminated by the resort to second differences and that 
the correlation then found, 74.5. was in truth rxy, the desired 
correlation between the two traits independent of the spurious 
element. 

The data in Table LVI, p. 274, kindly supplied by Mr. 
Willis H. Rich, have all the characteristics expected in series to 
be treated by this method. That the conclusions will be found 


274 STATISTICAL METHOD 


to be somewhat doubtful points the weakness of the method in 
its present state of development. 


TABLE LVI 
Chinook Salmon — Columbia River 


aad 


HATCHERY OUT- 


PACK IN 1000'S Fry LIBERATED 


DATE OF PACK oF CASES PUT ely LN IN SPRING OF 
Igststoy. Bi Ge Sy) Ah Bae 265 
T COO Memes mon or oy en: 335 
TOOUMIe snl semen lr ts 353 
ptoleee “a Gal a NS eae 344 
eek. i GF ty wea ae 288 OL he f 1890 
ety 5 ay Be Go 351 4.90 1891 
WI ay. d Ge Gg te 444 1.33 1892 
tse A A 370 4.10 1893 
OW ge 442 21 1894 
LSOSMES APE es a.) ee 346 .00 1895 
SS 6 6 pb oo Ft 286 3.39 1896 
Ie? go SG 2904 6.59 1897 
TOOIQee ee fen We aes os 334 (1) 21.94 1898 
iY Gs ok cag EE 375 12.87 1899 
TOO 3 Baers UME ts) ens 469 11.00 1900 
TOOA Mion ie bis 3 ave as 547 10.04 1901 
TOOSmentiee. ieee ne) s 572 24.10 1902 
TOOOM eee Mn bs 511 20.44 1903 
bie fey A. Ae th TER Mh 410 23.56 1904 
TOOS aoe ee cra erm eS 334 9.15 1905 
Q00 Sitio ae oe ret 300 T7203 1906 
TOTO SMMC a en) Nera ms 442 g.10 1907 
EQUI ome ieee ees te 609 16.44 1908 
DKS SRT! ae are. ao 365 15.43 1909 
VEG lahte ge quae ets 335 12.54 1910 
TOL AIMS tke) thes. Soci 419 13.97 (2) IQII 
OWES 8 OB ae a 508 15.41 IgI2 
WON TES arte Sal cee Eerie 511 26.10 1913 
LO 17ers ies os ies 450 41.58 I9I4 
MONEE 5. oe Gees oS 445 44.45 I9I5 
LOLOmcm reese) Gaal k es 475 53-24 1916 
iROvey Om Ge GBS id 477 25.03 1917 
56.80 1918 
22.57 1919 
25.00 (3) 1920 


(1) 334 is an estimate based upon the total pack for the year. 
(2) 13.97 is an estimate based on the total hatchery output. 
(3) 25.00 is a sheer estimate. 


METHODS OF MEASURING RELATIONSHIP 275 


The problem is to ascertain if there is positive correlation 
between the number of fry liberated from the hatcheries and 
the run of salmon later, particularly three years later, when the 
fry are grown and return to spawn. It is known that the 
salmon returns to the same river in which liberated and that 
roughly 8 per cent (these 8 per cent are small fish and would 
be equivalent to some 5 per cent in weight of pack) return to 
spawn one year after liberation, 20 per cent (or 15 per cent of 
the pack) return in two years, 50 per cent (or 50 per cent of 
the pack) return in three years, 20 per cent (or 25 per cent of 
the pack) return in four years, and 2 per cent (or 5 per cent of 
the pack) return in five years. Accordingly if there is positive 
correlation between number of fry liberated and size of pack 
independent of time, it should be greatest when correlating 
size of pack with number of fry liberated three years earlier. 

The means and squared standard deviations are given in the 
first two columns of accompanying Table LVII. 


TABLE LVII 
RATIOS 
rae R 
Merete Ssaarne? | 4s RATIOS 
Fyn a m + 1 

x 418.18 7,731.14 971 2.000 .486 
A; — 7.00 71595-19 1.939 3.000 646 
Az — 2.35 14,552-54 2.660 3-333 798 
As 62.35 38,704.94 3.077 3.500 879 
A, 5.58 ee 3.267 3.600 .908 
As 24.00 309,050.3 

Ay 56.18 1,294,512.40 ood ae sae 


The last column of the table shows the approach of the ratios 
of the standard deviations squared to a random situation, ie., 
a situation from which the time or space factor has been elimi- 
nated. There is seen to be some approach to the value 
4 —2/(m+ 1), but the approach is not sufficiently close to 
say that this test supports the contention that a resort to 
fourth, fifth, or sixth differences frees the data of the spurious 


factor. 
More promising results are obtained from the ‘hatchery 


276 STATISTICAL METHOD 


output” data. Keeping the data to the nearest .1 and shifting 
the decimal point one place to the right it yields 


TABLE LVIII 


RATIOS 
STanpARD DeEvIA- F RATIO OF 
TIONS SQUARED om+1 2 RATIOS 
ite ie ae 

y me 50 17,131.96 .480 2.000 .240 
6; — 8.22 8,231.07 eee 3.000 50 
82 — 11.65 17,507.46 2.900 3-333 -870 
53 12.52 59,779-33 3.525 3.500 1.007 
54 — 22.96 178,983.22 3.767 3.600 1.046 
55 15.87 674,219.57 3.856 3.667 1.052 
8 — 66.64 | 2,599,756.51 


We may conclude that so far as this test permits us to form a 
judgment we will succeed in eliminating the spurious factor by 
resorting to fourth or higher differences. 

Calculation of the product-moment coefficients of correlation 
between similar difference series gives the values recorded in 
the following table: 


TABLE LIX 

Tey = .3802 + .1275 
Tada = .0003 + .1580 
TA: = .O145 + .1826 
TAsd3 = — .0258 + .2023 
TAs = — .0247 + .2196 
TAsis = — .0005 + .2354 
rAcds = -0525 + .2503 


The probable errors have been calculated by the following 
formulas, which are due to Anderson (1914): Let rxy be, as be- 
fore, the correlation between the two variables independent of 
the time or location factor; let ooo be the standard error of rxy; 
o\, the standard error of 74,5,; o22 the standard error of ra.5, etc. 
Then, 


testa 4 (Standard errors of variate difference 
ON correlation coefficients) ......... [236] 


ae § yin=4 


es NG Per (NB 
22 N 


METHODS OF MEASURING RELATIONSHIP 277 


Le a 23! N — 843 


33 = 


N—3 100 
ie Tory pe N — 6128 
ou = 
a aad, 490 
aes tee 146189 N — 270635 
Ci 
Ih) 15876 
_1—? xy (1676039 N — 4696566 
66 a 
—6 213444 
LS xy 


ee gana ae 


Sp _2e=) 7 
aN = bia) Pa 


k (k — 1) (Rk — 2) 4 
a ea Olean Pee a) 


The N throughout the formulas is the original population and 
not the reduced number of differences. The final correlation, 
rxy, which maintains after elimination of the spurious factor, 
enters into all of these formulas. This correlation is of 
course not known, but if successive difference correlations 
remain approximately equal one may take this constant 
value as the value of rxy and determine approximate prob- 
able errors. For the problem in hand we see that the first, 
second, third, fourth, fifth and sixth difference correlations are 
closely equal to zero. Accordingly, taking zero as the value of 
rxy and using formula [236] we obtain the probable errors 
listed. Note that the standard error of rsy is given as 
(1 — rxy)/VN and not the usual value (1 — Pxy) (VN. 
That is to sav, rxy, could it be assumed to be a measure of rxy, 
has the standard error (1 —7?xv)/WVN, but as a measure not 
distinct from the space or time factor it has the usual standard 
error. In our present problem, since rxy/7as, does not approxi- 
mately = 1.00 we should not assume it to be a measure of rxy. 

The conclusion which this treatment suggests is that there 
is no relation between planting of fry and run of salmon three 
years later, but this is in no sense established, due to the large 
probable errors. It is of course unfortunate that, with the 
very type of data for which large populations cannot be secured, 
the probahkis errors should be larger than for straight correla- 


278 STATISTICAL METHOD 


tions. This is a weakness of the method in the field for which 
it would otherwise be most serviceable. 

It would be valuable to compare at length results obtained 
by the variate difference method with those from a partial 
correlation or partial correlation ratio method. The data in 
hand do not warrant too detailed an analysis, but it may be 
stated that, assuming either a rectilinear or a single flexion 
curvilinear regression line between time and each of the other 
two variables, the partial correlation between number of fry 
liberated and run three years later is positive and slightly 
greater than its probable error. Thus, for these data, the 
two methods do not point in the same direction. 

Calculating variate difference correlation coefficients between 
number of fry liberated and run two, and again four, years 
later yield equally inconclusive results with those reported. 


CHAPTER XI 
MULTIPLE CORRELATION 


Section 79. Tue PROBLEM 


The fundamental problem of multiple correlation is the 
estimation, with minimal error, of one variable knowing several 
others. Thus if Xo is the dependent variable, or the one to 
be estimated, and X;, X2---, X, the independent variables, 
and if Xo is the value of the dependent variable as estimated 
from the known Xj, X2,::-Xx variables, we may write 


XG et (Xe eee) 
and we will say that that function which makes 


2 CQ = Xo)? 


W =="q) MUTT, enero [237] 


is the best function. Since (Xo — Xo) is an error of estimate, 
this is identical with imposing the condition that the sum of 
the squares of the errors of estimate shall be a minimum. 
Just as we have found that there are many methods of measur- 
ing correlation, so there are many ways of measuring multiple 
correlation. The five following are important, but not inclu- 
sive of all possible methods. 

(a) When f(X1 X2:++Xn) is a linear function of the variables 
we have the usual multiple correlation problem, and the 
method to be used is both the simplest and the most readily 
interpreted. 

(b) When f is a known, but non-rectilinear function of the 
X’s, appropriate transformations as suggested in Section 52 
will ordinarily enable the treatment of this problem by methods 
applicable to (a). 

The complete problem of simple or multiple correlation 
involves, as has been stated, (1) a measure of the strength of 

279 


280 STATISTICAL METHOD 


relationship between a dependent variable and one or more 
independent variables, and also (2) an algebraic means of 
estimating the dependent variable knowing the independent 
variable, or variables. Whereas methods (a) and (b) preceding 
give solutions of both (1) and (2), methods (c), (d) and (e) fol- 
lowing provide a solution of (1) only. 

(c) A multiple and partial correlation ratio method enabling 
an estimation of the magnitudes of the multiple and partial 
correlations between graduated variables which are not related 
to each other by means of rectilinear regression lines. Also, a 

(d) Multiple and partial contingency method accomplishing 
the same result as multiple and partial correlation ratios, and 
particularly applicable to data recorded in a categorical manner. 
This method also leads to interpretation in terms of probability. 

(e) The variate difference correlation method. This method 
is of service when a time or space factor not showing rectilinear 
relation with the other two variables involved hides or clouds 
the partial relationships between the two variables. This 
method has been presented in the preceding section and is very 
different from (a), (b) and (d). The treatment of the next 
five sections is confined to method (a) and covers the 3 or 4 
variable problem in Sections 80, 81, 82, the 4, 5, or 6-variable 
problem in Section 83, and the many variable problem in 
Section 84. 


Section 80. THEORETICAL TREATMENT — 3 VARIABLES 


A simple three variable problem, so chosen that the interpre- 
(ation is not complicated by unequal variabilities of the three 
eries, will show the concrete and tangible significance of the 

irtial and multiple correlation coefficients. 

We shall use the following notation. 

X = a gross score. 
x = X — M = ascore as a deviation from the mean. 
o = ox = ox = the standard deviation of either the x’s 


or the X’s 
ene Xo | 
oa eae es a standard measure 
és Dx" 
Olgea = 1.0 


MULTIPLE CORRELATION 281 


Symbols with subscript zero as Xo, x0, a0, Zo, designate the 
criterion or dependent variable. Symbols with subscript 1 
designate the first independent variable, with subscript 2 the 
second independent variable. The following symbols with 
superior bars Xo, %o, 20, designate gross criterion scores esti- 
mated from a knowledge of the independent variables, devia- 
tion scores estimated from such a knowledge, and standard 
scores estimated from such a knowledge, respectively. The 
statistical problem is to determine the two constants Bo1.2 
and (o2.1 (the significance of the subscripts is explained later) 
in the equation 


Zo = Bo1.22%1 + Bor1z2 (Fundamental regression equation connecting 
standard measures — 3 variables).........[238] 


so that the standard error of estimate ko.12 is a minimum. 


(0 — Zo) (Error of estimate or residual of 
a standard criterion measure) [239] 


is the difference between the actual standard criterion score and 
the criterion standard score estimated from the independent 
variables. It is thus an error of estimate and the standard 
error of estimate is 


k rd y> (go — 20)? (Standard error of estimate of the 
aS N standard criterion measures) . . [240] 


If z and ze are worthless in shedding light upon the value of 2 
then Bor.2 and Bo21, the weights appropriate to the 2’s, will be 
zero, and Zo will equal zero for every individual. In this case 
Ro.12 = 0, = IL.O. 

This is the maximum value that k can ever take and means 
that the error of estimate has not been reduced at all by the 
use of 2, and z over what it would be were sheer random guesses 
resorted to. If 2 can be perfectly estimated from % and 2% 
then every (zo — 20) equals zero and ko... = .oo. This is the 
minimum value that k can take and corresponds to perfect 
estimation, or zero errors of estimate throughout. In the 
symbol ko.12 the subscript before the point designates the 
variable estimated and the subscripts after the point designate 
the variables from which the estimate has been made. The 
problem has been stated to determine the 6’s so that k shall be 


282 STATISTICAL METHOD 


a minimum. The constant ko... is the standard deviation of 
the errors of estimate when scores are expressed in terms of 
standard measures. Its meaning is thus easily grasped and 
obviously very important for the magnitude of the error in- 
volved in estimating one variable knowing all the others is the 
first item of information needed in interpreting the significance 
of the relation between variables. It will later be shown that 
ko. varies directly as oo.12, the standard error of estimate of 
the xo’s, or the Xo’s, so that establishing the minimal error 
condition with reference to the standard measures also estab- 
lishes it with reference to the gross scores. 

The following derivation of the values of the #’s is brief and 
simple, but involves an understanding of calculus. For those 
unfamiliar with calculus a numerical illustration showing the 
concrete significance of the constants involved is given in the 
next section. 

It is required to so choose Bo1.2 and Bo. that the standard 
error of estimate shall be a minimum; that is, 


Z (0 — Zo)? = F (Zo — Bor.221 — Bor.122)? 
is to be a minimum. Differentiating first with respect to 
Bo.2, and second with respect to Bow., gives the two following 
equations 

Z 2 (Zo — Bor-221 — Bo2122) (— 21) = 0 

D 2 (Zo — Bo1.221 — Bo2.122) (— 22) = 0 
Dividing by — 2 N, summing the several parts, and remember- 
ing that 


= 33, >2) B29 

“NV = WN = 1.0 
that 

z 2021 

Saree OL 
that 

z ZoZ2 

ane a0S 
and that 

Z 2122 _ 

Rie 
we obtain 


Yor — Bor.2 — 7128021 = O 


rit bee en (Normal equations). . .[241] 


MULTIPLE CORRELATION 283 


Solving simultaneously 


Lai Sa LaLa} 


Bore = : : 
I —?*12 (Regression coefficients between standard 
_ rox — Torrie ASHES —— 4) WERGEIOIES)), oo coco 0 oc [242] 
1c ————. 
| mete ean 


This completes the solution of the 3-variable regression equa- 
tion involving standard measures. We will make the usual 


transformations, 
ae oat 8 


o 


z 


and express the result in terms of gross scores, giving 


eo Sat = Bore ea p a) + Bort (= = um) 


d0 02 
which, upon simplification, becomes, 
By, = Bou. ane X1 - Bo2-1 Xs a (26 = Brea M, aa Bo21 ay M:) . [243] 
1 a2 o1 02 
Defining bo1.2, bo2.1, and ¢ by the following equations 
bo.2 = Boa — » dor = Boni Pbiie- 1a tale lols Re het ohseonuens [244] 
O1 02 


¢ = Mo — bore Mi — bo21Me ...........-.. [245] 
equation [243] may be written 


Xo = bor.2 X1 + boe1 X2 +c (Regression equation involving 
gross scores — 3 variables) .[246] 


Very simple algebraic derivation will show that in the case 
of n independent variables we have 


v0 
bo1-23...2 = Bo1-23--.2 — 
o1 


bo2.13-..” = Bors...n bis gare, Soaee Te: muah ei anecee yore [247] 


in which Bo1.23... n) Bo2-13--- , etc., are defined by formula [264 b] 
c= Mo — bo1-23-.-2 M, — bo2.13...nMe2 Pays 


— Don-12-+. "1 bY Eee rE eT ON OAL Ua ee ea [248] 
Xo = doreg...2X1 + Boris. “nm Xo - ss 
bon-12.--—1 Xn a (OS arnceoo Ot oo ASE oto oS [249] 


Equation [246] is ordinarily the most convenient form to use. 
The constants bo1.2, bor.1 and c have numerical values which do 
not change for the entire population, and it only remains to 
substitute the gross scores, Xi and X2, to secure an estimate 
of the dependent gross score XG 


284 STATISTICAL METHOD 


We have determined the value of 81.2 in terms of total cor- 
relation coefficients 701, 7o2, and ry, and its use in the regression 
equation, but have still to discover the property which has led 
to the subscript notation. Let us find the regression of that 
part of zo which is independent of z, upon that part of 2; which 
is independent of z. Since the regression equation connecting 
Zo With z is a 

Zo = L232 
That part of z9 which cannot be estimated from a knowledge 
of %, or that part which is independent of %, is (Z0 — 70222). 
This magnitude we will designate by 20.2, which may be read 
“the residual in zo after estimation of 20 by aid of %”’ or ‘“‘that 
part of z which is independent of z:.”’ 
20.2 = (0 — ro2z2) (An error of estimate, i.e., 

aresidtdall) Sern ere [239 a] 

Obviously the N residuals, 29.2 cannot be estimated at all by 
means of z2, since z. has already been used for all that it avails. 
This is merely equivalent to saying that the regression of 20.2 
upon 2 is equal to zero. The proof is simple: 


bo.2,% 
Z 2y.232 = Z (Zo — roaze) 22 = LT So%2 — ror & 2%2 = Nroz — Nroo = 0 

accordingly bo.2, 2 = o. Wemay, however, estimate these resid- 
uals by means of variable 1 which is a new source of data. Since 
Zo.2 has zero regression upon 2, it of course has zero regression 
upon that part of 2; which can be estimated by means of 2. 
To estimate 2 from z we have 2 = 72% 
so that 
E (30-2) (11232) _ f12 J Sy.282 

DY (71232)? > (rieze)? 
It is therefore clear that only 21.2(= 21 — rie%), that part of z 
which is independent of 22, is of service in estimating 20.2, that 
part of z) which is independent of z2. The regression of Zo. 
upon 21.2 is 


D(z0.2) (rioz2) = =o 


2D 2Zo.2%1.2 a z (20 = F022) (21 a2 11232) 
Zo? 1.2 2 (31 — risze)? 
_ Yor — Yoati2 — Yoxti2 + Yoorie 
I = 2719? -- 7197 
ee hOv rea? ON1s 
=a emery Sa? (BRON) 


I — 7712 


MULTIPLE CORRELATION 285 


We now see the meaning of the notation Bo». It is the regres- 
sion of that part of z) which is independent of z, upon that 
part of z, which is independent of . For this reason Bo1.2 18 
called a partial regression coefficient and, to recapitulate, it 
has the two following important properties: 

(a) It is the regression of that part of ) which is independent 
of z upon that part of 2, which is independent of 2. 

(b) Iv is the weight or multiplying factor of 2 when and % 
are both used to estimate Zo. 

Of course Bo2.1 is the comparable partial regression coefficient 
when variables z; and x are interchanged. We will now illus- 
trate this by a numerical example. 


Section 81. THREE-VARIABLE PROBLEM ILLUSTRATING 
MEANINGS OF CONSTANTS 


The first three columns of Table LX constitute the se- 
ries to be correlated and the subsequent columns are derived 


calculations. 
TABLE) LX 


20 21 22 70222 20-2 71222 Z1.2 Bo1. 221-2 20.12 20 


1-75} 100, .25| .1237| 1.6263 — 0638] 1.0638  .8667) .7596|  .9904 
1.25] .25| 1.00) .4948) .7552|/—-2552) -5052) AT 16} .3436|  .9064 
1.00; .00} 1.00] .4948]  .5052|—.2552] 2552) = -2079) -2973) 7027 

.75| 1.50}  .00] .0000| —.7500)  .0000) 1.5000) 1.2221 —.4721| 1.2221 
.25|— .75| 2.00] .9896|— .7396|—.5104|— .2396/— .1952|—.5444| 7944 
25) 1.25|— -50/—.2474) -4974 .1276| 1.1224) .9145|/—.4171 .6671 


.3639| — .0202)— 1.2298 
1.5775|— -0462|—1.4538 


— 25] .75/—1.25| .6185] .3685] .3190) 4310 RA) LGB Boe 
— .50|—1.00] .00| .0000}— .5000) .0000)/— 1.0000) — .8148] .3148|— .8148 
— .75|  .00|\—1.00|/—.4948)— .2552|  -2552|— .2552|— .2079|—.0473/— .7027 
—J,00/—I.00] .00} .0000/—1.0000} .0000)— 1.0000) — .8148|—.1852)— .8148 


—1.25| .00/—1.75 —.8659 — .3841| .4466)— .4466 
—1.50|—2.00 25123700237, — .0638]— 1.9362 


LD 2021 = 7.6250, tor = -63542 
(Ai SP APIOY 
2 2022 = 5.9375, v2 = -49479, Bore = Serer Ore 81476 
12 — Yor" 
Zz_= — 3.0625, m2 = —-25521, Bor = - — Pie. = .70272 
D 20.221.2 = 9.14030, Zo = Bor.-221 + Bo02122 = 81476 21 + .70272 2% 
DZ120= I11.21842 a 
9.14030 _ re y/o Ss 
Bo1-2 => T1.21842 a .81476 00-12 12 = .36686 


2 270.12 = 1.61504 


286 STATISTICAL METHOD 


These series have been so chosen that the means equal zero 
and the standard deviations equal one. We are thus dealing 
with standard measures, or 2’s and not with x’s or X’s. 
Straightforward calculation gives 


D 2021 7.625 a 
ISON 6 CLO Ieee 63542 
ro2 = .49479 
Tyg = — .25521 


We can estimate zo by means of z by the following equation: 
Zo = o22 = .49479 22 


These estimated values are recorded in the column rq2%. The 
residuals (zo — ro222), or parts of z9 which are independent of 2, 
are recorded in the column 2.2. We can estimate 2 by means 
of z by the equation 


21 = 71222 = — -25521 22 


These estimated values are recorded in column ry2z. The 
residuals (21 — 712%), or parts of independent of 2., are recorded 
in the column 2.2. That part of z; which is independent of z, 
namely 21.2, may be used to estimate zo.. Straightforward 
calculation of the regression equation gives 


30.221.2 ‘a a. 9.14030 
el. 
2D 3"1.2 11.21842 


Z1.2= .81476 21.2 


The constant .81476 (= Bo1.2) is here seen to be a regression 
coefficient, being just as real and definite in its meaning as 
those found in any other two-variable problem. Finally taking 
(zo.2 — Bor.2%12) We obtain 20.12, the final residuals that are left 
after having utilized both 2; and x to the utmost in estimating 
z. These magnitudes are our final errors of estimate. Cal- 
culating their standard deviation in the usual manner we 
obtain 
Rois = .36686 


The residuals Zo.1.2 could have been obtained more directly 
without the calculation of 29.2 and 2.2 by the regression equa- 
tion involving the two variables. We have 


20.12 = 20 — Bo1-221 — Bo2.122 


MULTIPLE CORRELATION 287 


in which 
iG SAA 
1 7p 


= .81476 


Bo1-2 = 


100 Foire 
02.4 = ——,__ = -70272 
B eer Ee 7027 


The more lengthy procedure has been followed for the purpose 
of showing the exact significance of the 6 constants and of the 
residuals, and not because it is the most practical method for 
purposes of estimation. If we add the measures in the two 
columns 7222 and Bo1.221.2 or if we use equation 


Zo = Bor.2%1 + Bo2-122 


we obtain the best estimates of zo which it is possible to secure 
from 2, and z, assuming a rectilinear relationship. Such esti- 
mates are here recorded in column zo. The correlation between 
Z and Z is the multiple correlation coefficient and will be 
designated by the symbol row. As multiplying every term in 
a series by a constant, or adding a constant amount to every 
term, does not change the correlation with a second variable, 
the correlation between 2 and 2% is identical with that between 
x. and % or between Xo and Xo. The multiple correlation 
coefficient is the maximum correlation obtainable between 
dependent variable and a weighted composite of the inde- 
pendent variables. We may therefore read 1.12 as “the cor- 
relation between the variable o and the best weighted linear 
combination of variables 1 and 2.” Straightforward calcula- 
tion of the correlation between columns 20 and zo yields ro. 
= .93028, but a much shorter method of calculation is available. 
We have in a two-variable problem 


Cie = o1vl — 1 


Since g) and Z are simply two variables and since the standard 
deviation of z) = 1.0, and the standard deviation of the resid- 
uals in zo after estimation by aid of 1 and 2 is ko. we have 
ko.12 = 1.0 VT — 70.12 
from which 
70.12 = Vt — k%12 (Value of the multiple correlation 
coefficient — 3 variables) ... .[250] 
The relation between ko.12 and 70.12 is the same as that between 
ky and rn of Formula [86 aj, section 48, hence ko.12 is a coeffi- 


288 STATISTICAL METHOD 


cient of alienation in the case of three variables. We now need 
a simple procedure for the calculation of ko.2. Since Ro. is 
the standard deviation of the residuals we have 


1 
Ro.12 = ne (0 — Bo1.221 — Bor-132)? 


Squaring, summing, and collecting terms we will find that the 
factor (1 — 712) enters into numerator and denominator. 
Wherever this factor occurs we will write k?». Remembering 
that 


and that 
Z 2021 = Nro, 2 2022 = Noo, D 2122 = Nrie 
we nave 
R%o.12 = I + Bor.2 + Bo21 — 2 Bor-2%01 — 2 Bor-1702 
+ 2 Bo1-2802.1712 


I 
arg ti (1 — 7201 — 7202 — 1712 + 2 Toi o2P12) 
1 


(Coefficient of alienation — 3 variables)..[251] 


The general solution of the coefficient of alienation in the 
case of u variables is well accomplished by the aid of determi- 
nants, and we may here note this form of solution for the case 
of three variables. If we write the major determinant 


I You Yo2 
A =|} fro I VAD! Nsw perma’ atmiene stale aye l sont neater [252] 
Yo2 Ti2 I 


and call the minor obtained by deleting the first row and the 
first column Ago, we have 


I ri2 
Y12 I 


Aoo = 


Evaluating these determinants we obtain the numerator and 
denominator respectively of the fraction giving k%.12 so that 
we may write 
he Es (Multiple coefficient of alienation as 

Aoo the quotient of determinants) ...[254] 
This is here proven for the case of three variables, but we will 
later find that the equation holds generally for any number of 
variables. If we are concerned only with the value of the 
multiple correlation coefficient, and not with the constants of 
the regression equation, the simplest way to find it is to first 


MULTIPLE CORRELATION 289 


determine ko... and then 79.1... If we have the regression coeff- 
cients we may obtain ko... and thus 70.1. from it. We have called 
ko. the multiple alienation coefficient. It is the measure of 
independence of variable x» from variables x and x. We will 
define koi.2 as the partial alienation coefficient. It is the 
measure of independence of x» and x; for a constant value of %2. 
Thus, by definition, if ro. is the partial correlation between 
xo and x; for a constant value of 22, we have 
Ro.2 + 7201.2 = 1.0 (Relation between partial coefficients 
of correlation and of alienation) ... [255] 

This is the equation for three variables comparable to formula 
[86 a], k12 + 7° = 1.0, found for two variables. We thus find 
that whether k has one primary subscript (a subscript occurring 
before the point is termed a primary and one after the point a 
secondary subscript), Ro.12, or too primary subscripts, Ro. the 
type equation, k? + 7? = 1.0, holds. Thus far we have found 
the total, multiple, and partial relationships as follows, respec- 
tively. 

Ro +7701 =1 

Ro.12 + 770.12 = I 

Ro1.2 + 1701-2 I 
The same relation will be found to hold when variables are 
involved, so that universally, provided the subscripts are the 
same, 


ll 


kRe+re=1 (General relation between 

IPEIBLYD) S20 ca oo 0606 » IAS 
We do not have a & with three primary subscripts, but ko 
and ko.1 may be shown to be identical. Dealing with z’s we 
have found ko = V1 — ro. and ko. = the standard deviation 
of the arrays of 20's, ie. Koa = om» Vi — ry = Vi — Po, 
since, when dealing with z’s the standard deviation o, is equal 
to 1. Accordingly é 


Row = Rox. Ny) Gee ROME Ce Olio DeOe esc [257] 

Equations [251] and [254] have expressed Ro.12 in terms of 
the total correlation coefficients. We may also evaluate this 
multiple alienation coefficient in terms of other total and partial 
coefficients, but will first need to determine a partial coefficient 
of correlation. Having shown that Bo. is the regression of 


290 STATISTICAL METHOD 


Zo.2 Upon 2.2 and since by parity Bio.2 is the regression of 21.2 
upon Zo.2, we immediately have, since every condition leading to 
re =WVbybe, formula [go] is exactly paralleled when dealing 
with zo.2’s and 21.2’s, 
Yo1.2 = V Bo1-2810.2 
(Partial coefficient of correlation in terms of partial 
regression coefficients — 3 variables) .........[258] 


The partial coefficient ro1.2 is identical with rio. but custom 
places first the numerically smaller of the subscripts before the 
point. 

I — 7291 — 102 — 1712 + 2 foifoo?12 


ko1.2 = 1 — Bo1.2810.2 = 


R* 2k? 02 
Pa barat I —7%o1 — 1702 — £712 + 2 oifoar13 Bey Fay 
R12 
that is 
Roz = Rooko1.2 (Multiple coefficient of alienation in terms of 
or total and partial coefficients of alienation 
Rois = RorRor ==)3' Vatiables)) :cmpsyscsn ees eee [259] 


We may now outline the most expeditious manner of calculating 
all of the constants ordinarily desired in the solution of a 
multiple correlation problem. These constants recorded in the 
proper order of calculation are: 


the means, Mo, Mi, and M. 

the standard deviations, oo, o1, and a2 

the total correlations, 11, 702 and ry. 

the squares of the total alienation coefficients k1, ko. and 
4 

the 8 regression coefficients 


01:2, 


TOUe at 02113 _ For —Toe?ie Sa Sel dae 

A + LO eng < iia, Bora = aay i 
the square of the partial correlation coefficient 

ro1.2 = Bor-2 Bro.2 
the square of the partial alienation coefficient 
Ro1.9 at Pore 
the square of the multiple alienation coefficient 
9.19 = Re ok 01.0 

the multiple alienation coefficient, ko.12 
the multiple correlation coefficient, ro.2 = V1 — Ro. 


MULTIPLE CORRELATION 291 


the b regression coefficients 
bo. = Bore ae boo = Bor-1 z= 
O1 02 
the constant ¢ 
¢ = Mo — bore Mi — bo21 Me 
giving the regression equation 
Xo = dors X1 + bo2.1 X2 + € : 
the standard error of estimate, or the standard deviation 
of the Xo-arrays from the regression line 
dow = 0 Row 
Excepting the probable errors of the constants (see formulas 
[278], [279] and [280]) the solution is complete. 


Section 82. Tur USE oF THE ALIGNMENT CHART 


The calculation of the 8 constants may be easily accomplished 
by the aid of an alignment chart. The following directions 
apply to the small chart in the appendix and described in 
detail and with explanatory problems in (Kelley, 1g21, chart), 
and also to a large chart devised upon the same principle 
(Kelley, 1921, align). Items (7) and (7) and the four-variable 
problem illustration should be read after the treatment of 
the 1 variable problem, Section 83, of this text. The accuracy 
of the chart in the appendix is very slightly less than that 
of a ro-inch slide rule, while the large chart gives results 
of the same degree of accuracy as a 20-inch slide rule. 

The scales for 713 and 723 are graduated according to the 
logarithms of numbers from io to roo, and the product scale 
is so graduated as to indicate the products of any two numbers 
on scales 713 and r23 when connected by a straight line. Ac- 
cordingly all products and quotients, including squares and 
square roots, may be obtained. In all these operations the 
simplest way to keep track of the decimal point is to roughly 
carry the operation through in one’s head and then place the 
point where it belongs. A strip of transparent celluloid with a 
straight line scratched upon it, or a silk thread drawn taut, 
constitute serviceable straight edges. 

Scale 1/k is graduated according to the logarithms of 
WVu-s?r and scale 1/K according to the logarithms of 
1/1 —7. Scale 1/K? is a continuation of scale 1/k?. When 


292 


STATISTICAL METHOD 


values on scale 1/K? are used, place a straight edge through 
this value and parallel to the base line [as explained in example 
(c)| and locate a point on scale 1/k?. Then continue the cal- 
culation using the point so located on scale 1/k? in lieu of the 
point on scale 1/K?. 

The following magnitudes are needed in multiple correlation 


work: 


(a) 
(0) 
(c) 
(d) 


(¢) 
(f) 


(g) 
(h) 


(7) 


G) 


(k 


wa 


Products, such as 71373 
. O71 
Quotients, such as —- 
02 
Square roots, such as NI Babies 


I : A . 
Factors — (= pe) which enter into partial 
kis Vv | ER oe I 


coefficients of correlation 
Coefficients of alienation, such as ki3 (= Wr — 7°43) 


I I : : : 
Factors = (= 5 ) which enter into regression 
ko3 a as PPE 


coefficients 

Squares of coefficients of alienation, such as k%3 
(= [- 723) 

Partial regression coefficients, such as 


Biv.s ( 2 Re La33) [247] 
23 


Partial correlation coefficients, such as 
112.3 (= — <a \ Baoan =< Vizsbus) 
Partial regression coefficients involving four variables 
ee Bio.4 3 B13.48 32.4 =a By2.3 = B14.3B42.3 
Bist (ee ee) 

Since 23.4 = 1 — B23.4832.4, and since the calculation 
which leads to fy3.4 is changed in but one simple 
respect to obtain 63.4 it is convenient to write: 

Pie.a — B13.4832-4 
ees Sor Bos.48a2-4 204g) 

Partial regression coefficients involving more than four 

variables 


x5 Bia. eo eee Bis.4 ee Nn Bz0.4 oy 8 
Ti—/Gopda ken Oes-d name n 
The same procedure as in (/) is followed, but in this 


Bie.34 een 


[264 b] 


MULTIPLE CORRELATION 293 


case the calculation which leads to Bo3.4...n does 
not, by one simple change, lead to By2.4...7. 
Examples: 
(a) .2X.4 Place a straight edge on 20, scale 43, and 
upon 40, scale 723, and read the product, .o8, on the 
product scale. 


(0) Si Place a straight edge upon 20, pr-duct scale, and 


upon 4o, scale 73, and read the quotient, 5.0, on 
scale 113. 

(c) V.25 Place a straight edge on 25, product scale, and 
parallel to the base line of the chart (this can be 
done by rotating the straight edge until the readings 
on scales 713 and 723 are identical) and read the square 
root, .50, on either scale 73 or 793. 

I : 

(d) Wars, ae Find 60 on scale 1/k and read the answer, 
I — .60 
1.25, from the same point on scale 73. 

(ec) Vi — .60? Place a straight edge through 60, scale 
1/k, and 100, product scale, and read the answer, 
.80, on scale 793. 

(f) a Find 60 on scale 1/k? and read the answer, 
1.5625, from the same point on scale 793. 

(g) 1 — .60? Place a straight edge through 60, scale 1/k?, 
and too, product scale, and read the answer, .64, 
on scale 113. 


(h) .78 - oe 80 Find the product of .60 and .80 by 


(a). Ona separate scratch paper subtract this from 
.78, obtaining .30. Place a straight edge between 
30, scale 713, and 80, scale 1/k?, and read the answer, 
.833, on the product scale. 

.60 X .80 


We 8 = 100 KGB0 Go et 78 by (h). 
©) Vt = 60? V1 = 807 I — .80° y 
Find we = oo ee by (h). Multiply and extract 


the square root by (a) and (c), yielding the answer 
.625. 


204 STATISTICAL METHOD 


(7) Given: By.4 = .70; Bis.4 = 60; B32.4 = .80; Boz.4 = .5460. 
.70 — .60 X .80 
I — .80 X .5469 
ator as in (hk) and the denominator in the same 
Then divide as in (b). This gives 


Find the numer- 


Required : Bre. 34 = 


manner. 

.2200 

15625 eB XO MUARe 

If, as is frequently the case, Bs:.4 and Be3.4 are nearly 
equal, k.3.4 is closely given by: 


edie (ens) 


In this case the procedure may be as follows: 
270. — 00 2 .60 
Vi 9 fofet Ne Ly K6) 

Find the numerator, .2200, as before. On scratch 
paper determine .78, the arithmetic average of .80 
and .76. Place a straight edge between .78, scale 
1/k*, and .22, scale 713, and read the answer, .5618, 
on the product scale. This answer is in error by 
.0006, which is of the same order of magnitude as the 
error attendant upon the use of the large chart. 

As a sample problem in three variables the following 
data are given: 


TABLE LXI 
Table of Correlations, Means and Standard Deviations 
VARIABLES 
I 2 3 
2 225 
3 274 404 
Means 68.15 43.60 52.20 
o's 10.50 12.24 9.63 
Solving 
Bios = .1366 
Bo1.3 = .1236 
Bis 2 = .2200 
k*\.23 = .9093 
71:38 — «30LT 
0) 22 = 10.01 


2= .1366 Z2 + .2200 23 
X, = .1172 X2 + .2399 X3 + 50.52 


MULTIPLE CORRELATION 295 


As a sample problem in four variables the following 
data are given: 


TABLE LXII 
VARIABLES 
I 2 3 4 
2 .225 
3 274 -404 
4 134 .060 .231 
Means 68.15 43.60 52.20 45.40 
o’s 10.50 12.24 9.63 14.25 
Solving 
Biz34 = .1398 
Bo134 = .1270 
Biz 24 = .1991 
Bis.23 = .0796 


R?; 234 = .9033 
11.934 = .3109 
O11 234 = 9.980 
Z1 = .1398 ze + .1991 23 + .0796 24 
X, = .1199 Xo + .2171 X3 + .0587 X4 + 48.92 


Section 83. Tur GENERAL TREATMENT OF THE 1”-VARIABLE 
PROBLEM 


We will now attack the general problem. The reader will 
need an elementary knowledge of determinants to follow the 
discussion. We are given a criterion variabie, Xo, and the 
independent variables, Xi, X2,---, Xn (the population will be 
designated by N, which symbol must not be confused with n, 
the number of independent variables). Expressing every 
variable in terms of standard measures by the transformations 

X —M 


o 


Z 


it is required to determine the 8 constants in the following 
equation in the best fit manner. 
Zo = Bor-23---n 21 + Bo213---n 22 + °° + Bon.12...n-1 2n .. [260] 


(so — Zo) is an error of estimate and will be designed by Zo.12- «+ a: 


206 STATISTICAL METHOD 


The B’s are to be so determined that the standard error of these 
errors of estimate, ko....n, shall be a minimum. 


I 
ko.12...0 = 2 Voir. m 
pes 3 —-+--—8 )2 
= 77 = Go — Boras...m 21 — Boris... 22 0M -12-+- 2-1 


Differentiating with respect to the first 8 and setting the deriva- 
tive equal to zero, gives 


2 
n= [Zo — Bor2g... 21 — Boris... 22 — °°* — Bon-12---n—1 2n] (— 21 )e—=1 0 


Summing, expressing square sums in terms of standard devia- 
tions and product sums in terms of correlations, yields, 
foi — Boi 28..." — 112802 18..." — °° = TinBon.12---n—-1 — O 


Differentiating successively with respect to the other f’s gives 


Tor — 712801 23... — Bo213...m — *** — YanBon.12.-.n-1 = O 
etc. to 
Ton — YinBo1 23.-.n — TonBo213-..n — *** — Bon-12...n-1 = O 


(Normal equations). . . [261] 


This gives v linear equations from which to determine the same 
number of 6 constants. The determinantal solution is readily 
written. Let the major determinant be A. 


I To. ro2 Yon 
Yo. I ri2 Yin 

A=|7fo2 112 I Fase Wareiciee = ciee [262] 
Yon Yin Ton 5.81 I 


and let Ap, be the minor obtained by crossing out the p’th 
row and q’th column of the major determinant. The p’th 
row is that row having p as one of the subscripts of the r’s 
throughout and the gth column is that column having g as one 
of the subscripts throughout. Then 


_ ~(—1)PAop (The regression coefficient as the 


Sopa cent nae : 
: ye Aoo quotient of two determinants). .[263] 


The quantity — (— 1)? is merely a sign factor. The column 


MULTIPLE CORRELATION 207 


crossed out is the o’th for all the B’s so that g= 0. To illus 
trate in detail we have 


Yo. Lil} 138 CO 1 


72 ID 123 eee Ton 
Y 03 123 I T3n 
B Aor Ton Ton ran I 
01:23...2 = = = = 
Aoo I 112 «18 Yin 
iv} I 23 Yon 
13 123 I ois el BI 
Tin Yon fen Ciske I 
EO” WB UAT Wavy ORS 870 y73 
MOD GY OSS TAR OFS ATG 
Y 03 ihe AW AYE “SIS of ayn 
| 
Ton let 1 S13 e Tage Oo = I 
B a SAN, [264] 
02 Seet se — ee 
Aoo Aoo 
Aos 
Bo3-124...% = =— 
Avo 
—Aoa 
ii) — 
Bo4-1235 + « 
Aoo 
ine, uO) 
B _ —(—1)"Am 
COMO Stich feat ae 
Aoo 


Algebraic manipulation (see Kelley, 1921, chart) enables the 
expressing of a partial regression coefficient in terms of partial 
regression coefficients of one lower order, thus, 


_ Bi24 — Bis-48a24 _ Bi23 — Bis-3842.3 
eee et = B23.4832-4 I > Bushee ° RE. 


and in general 
— Bira-.m — Bi3.4... 0832.4... 
i Bo3.4...n032-4...0 


B12.34---" 


Note that if the variables are designated by subscripts 1, 2, 
3, --+ instead of as here, by o, 1, 2--- the sign factor is given 


2098 STATISTICAL METHOD 


by —(—1)?+2 in which q always equals 1. Probably the 
simplest way to keep track of the sign is to note that the 
denominator determinant is always positive and that the 
numerator determinants alternate in sign beginning with plus 
for the first 8. Let us define 


Bdq:12 20 C)com C eset 3 : 
and (Conjugate p’s).. . .[265] 
Bap-12--- ( Yeae( Veet 
as conjugate regression coefficients. Then 
— (— 1)? t9A pq 


[ep Tood Osa Once: = Ke 


and 
— (—1)2t PA gp 

Aqa 
Since the major determinant is symmetrical Apg = Ag» and 
the signs of the two are alike; thus the partial correlation 
coefficient is given by the square root of the product. 
_ — (—1)?+4Apq (Determinantal expression 


"pgiree Ore Oren VApp V Aaa for the partial coefficient 
of correlation) a7 seer [266] 
The partial correlations that are of most interest and value are 
generally those involving the criterion and required in the 
calculation of the multiple alienation coefficient. 


(stfEnb noo ado Ovoue:! = 


- An _—- (A partial correlation coefficient 
Vio Van of the (7 — 1)th order)..... [267] 


This may be written (Kelley, 1921, chart) 


701-23---” 


701-23---n = AB aes on ee Pe Rn CT ry eras - [267 a] 


The order is determined by the number of secondary subscripts, 
thus ro1.2345 is a partial coefficient of the 4th order, 71.2 of the 
first order and ro of the zero order. 
=. Atay (Determinantal expression for 
VAo1,01 WAi2,12 4 partial correlation coef- 
ficient of the n-2 order). . . [268] 
The magnitude Aoi, » indicates the minor obtained by crossing 
out the o and rt row and the 1 and 2 column. Note that the 
sign factor is positive. This is clearly the case, since we are 
now really dealing with a major determinant of an order one 
lower in which row and column 2 have taken the place of row 


and column 1, row and column 3 the place of row and column 
DAO 


1 02-34.. 6" 


MULTIPLE CORRELATION 299 


Continuing 
erie Aoi2, 123 
V Aoi, o1aV A123, 123 
etc. to 


Aoi2...m — 1) 123++-0 


— HOP 


‘Ton = 
ye Ee eas ipSbosp = 1 WN eae. 123-..2 IXI 
(Partial coefficient of zero order, or a total correlation coefficient) . . 269] 


The various minors needed in the solution of this series of 
partial coefficients of correlation may be obtained incidentally 
in the process of obtaining the first minor if the determinant is 
evaluated in a certain manner which, however, may not always 
be the most convenient way for other needs. Having the 
various partial correlation coefficients we may determine the 
partial alienation coefficients by the equation k = Vi — P. 
These will prove serviceable in obtaining the multiple corre- 
lation coefficient, but we shall first need to establish the value 
of an alienation coefficient of a certain order in terms of an 
order one less. In dealing with zo and 2 between which the 
correlation is ro, we have found, formula [257] 
h2o.1 = 0780 (I — 7201) = I (1 — £201) = R01 

If we deal with magnitudes 20.2, residuals in zo, after estimation 
by z, and 2.2. residuals in % after estimation by z between 
which the correlation is ro1.. we have, following the identical 
reasoning that led to the preceding equation, 

Ro.12 = Ro.2 (1 ae 791.2) = Ro.2 Ro1.2 = R02 Ro1.2 
Obviously the principle can be applied to residuals of any 
order so that, in general, 


k2.12-..0 — k 0 
ko.12. OY fe k9.13- oot 09.13. oot 
etc. to 
B29.12...m = R%.12... n—1 R70n-12---n—1 
(The n ways of expressing a multiple alienation coefficient of the 

n-th order, in terms of multiple alienation coefficients of the 
(n-1)th order and of partial alienation coefficients of the (m-1)th 
GROUSE o8 odrd a Bap ats alpen £ han eels ee enn Dear en oy ake [270] 


Expressing kp.23. . . nas equal to ko.34 . . . n R°oe.34. . . n and con- 
tinuing the process for every k, until finally k*.n = k°on, we have, 
taking the square root, 


Ro.12-..m = Ror.23...n Ro2-34... Rog-45.-.m X .-. X Ron 
(One of the many ways of expressing a multiple alienation co- 
efficient of the n-th order in terms of partial alienation 
BOCHICICNLS OL lOWelrOLGer)aomecr mee nce se vines eer). .(271] 


300 STATISTICAL METHOD 


Having the multiple alienation coecient we obtain 
Toit... = VI — Reaa-.cn (The multiple correlation coefficient) . .[272] 


and also 


O0.12-..6n = 0 Ro-12---n (Standard error of estimate)... ......[273] 


This completes the solution, but it is sometimes easier to obtain 
ro.12..-n by the direct evaluation of the major determinant A 
and the minor App. That we can obtain the multiple correla- 
tion coefficient in this manner will now be shown. If gp is the 
criterion and % the estimate of it, the correlation between them 
is the multiple correlation coefficient, and, if we let o_ repre- 
sent the standard deviation of the zo measures, it is given by 
70.12--.n = aes 
Nooo — 
The standard deviation of the 9 measures is the standard 
deviation of the points upon the regression line passing as 
closely as possible to the 2 measures. Thus, just as in the 
case of two variables where o%, = o%,..2 + 0%, [formula 87] in 
which og is the standard deviation of the means of the arrays, 
so here with (7 + 1) variables. 


o79 =070.12...n + a2 
Dealing with z measures oo = 1 and oo.w...n= Ron... ny) SO 
that 


9 9 
or = 1 — Rig. 


As we have already found that this is equal to ry. . . . » [formula 
272] we have 
g— =fo12...n (Standard deviation of estimated standard scores 

is equal to the multiple correlation coefficient) . [274] 
since 7o.12...n is of necessity positive. Total and partial 
correlation coefficients may be positive or negative; multiple 
correlation coefficients can only be positive. Thus con- 
tinuing we have: 


I 
W Zo (Bot-23...n21 + Boras...nZe2 + ... + Bon.12...n—12n) 
as ro1Bo1-23... + ro2P0213-+.n + see + YonBon-12-..n—1 


I 
ae [rorAo1 — rooAos + rosAos — ... (— 1)”ron Aon]. 
00 


MULTIPLE CORRELATION 301 


Referring to the major determinant, we see that, expanding it 
in terms of the elements of the first column, it is given by 


A = Aoo — rorAo1 + ro2doz —°**+ (— 1)” ron Aon 
thus 
I A 
fo taun = (Ann A) = 
0-12 n INS) ( 00 ) Aw 
or 
P Es \ eS A (Determinantal solution of the multi- 
soa Aoo ple correlation coefficient)........ [275] 
and further 


Ly (Determinantal solution of the multiple 
Aoo alienation coefficient).....-.........[276] 


Ro.12. ae ae 


As a corollary to the two derivations [formulas 271 and 276] 
we have 


A 
4 =ko1.23...0 Rov.34.- Pe ao OX Ron Bo ests es rt (277] 
00 


The preferable method for calculating ko.w ...» depends upon 
the order and whether the partial alienation and correlation 
coefficients are needed in the solution of the particular problem. 

The theoretical solution of the n-variable problem is now 
complete except for the probable errors of the constants in- 
volved, The standard errors of certain constants may be 
immediately written down by analogy with the usual two 
variable situations, simply noting, e.g., that xo. replaces xo 
and x1.. replaces 11, etc. Thus we have by parity with formula 
[108 b] 


. Rtor-2 (Standard error of a partial coefficient of 
EE SN correlation, 3 variables)............ .[278] 

ss Riot (Standard error of a multiple coefficient 
A Na of correlation, 3 variables).......... .[279] 


By parity with formula [107] 


oo.2Ro1.2 _ —_*F0-12 (Standard error of a b regres- 
“boi.2 Gea WN Sea sion coefficient, — 3 varia- 
bles) i.e aeerwer ert [280] 


Plainly we may, in the case of m independent variables, deal 
with residuals of higher order just as we have with residuals 
of first and zero order and obtain: 


Ro1.23-..2 (Standard error of a partial coefficient 


TRSEED OS Gi? WOMAN ig occ onde comes ona de [281] 


302 STATISTICAL METHOD 


Rk*o.123...n (Standard error of a multiple coef- 
a i 


¥0.123.. 6m VN ficient of correlation)).......+..... [282, 
_G0-193-- 0 (Standard error of a regression co- 
*bo1.28...m Skee, efficient) 2. ).5 tats. c aaneeeeee [283] 


Section 84. Tue Mertuop or SUCCESSIVE APPROXIMATIONS 


With more than five variables either of the preceding methods 
is laborious, and to meet this situation I have developed and 
herewith present a method of successive approximations to 
the values of the regression coefficients and to the multiple 
correlation coefficient. I have not as yet developed other than 
empirical tests of convergency. The method nay be best 
presented in connection with a numerical illustration. 

If given all the regression coefficients except the first, we 
may write 


Zo = Wid1 + Borrs...nZ2 + Bos1a...n’s + +> + Bonas...n—120. . [284] 


in which w, is unknown, but all the 6’s are known. We may 
now determine w;. Designating the right-hand member, ice., 
the total right-hand composite inclusive of w, % by c and the 
right-hand composite exclusive of w, = by (c — 1) [to be read, 
“the composite exclusive of variable 1 |] we have 


Bo Wey (1) ae Son ee Cae 


The problem is now a simple three variable problem, the 
variables being zo, s1 and (c — 1) the correlations between which 
we will designate as 70, Yo(c-1) and ry(c~1).. Two of these cor- 
relations have to be determined. Both Yo(c—1) and rye_) are 
correlations between one variable and a weighted sum and are 
given by formula [140]. Thus we immediately have the regres- 
sion coefficient of zo upon 2): 


— For — fo (e—1)?1 (e—1) 
R*) (c-1) 


Wy 


and the regression coefficient 2 upon (¢ — 1) equals 
To (e—1) — Yoit1 (e—1) 

ky (c—1) 
The weight w, as thus determined must be identical with 
Boies...» and the regression coefficient of (c — r) as thus 
determined must equal 1.0 else a better fit than the regression 


MULTIPLE CORRELATION 303 


equation fit has been obtained, which we know is impossible. 
We therefore see that if we know all of the regression coeffi- 
cients except one, we can determine that one without resorting 
to the evaluation of two lengthy determinants. 

The thought occurred to me that with reasonable weightings, 
guesses, or weightings somehow derived from a priori con- 
siderations, for a large number of variables no one of which 
was of greater importance than all the rest combined, it was to 
be expected that the closeness of estimate of the weighted sum 
of all the variables but one, which I shall call (¢ — 1), would 
vary less than the weight guessed for the one. Thus if the 
guessed weights are wi, We, w3-+-Wn, and if c is the weighted 
sum (wyz1 + Wee + w323 +---Wnrn), the calculation of the 
regression coefficient of zo upon %, ie., the calculation of 
Bo1-(c-1) Would result in a closer approach to Bo1.23...» than, 
in all likelihood, was w;. We will call this regression coefficient 
Wy and take it as a second approximation to Boies... A 
similar procedure using w, w3, W4, . . .Wn (not Wu, Ws, Wa, + *Wn) 
will result in a second approximation wy to the correct weight 
for z, etc., for each of the other variables. We then have 
weights wy, W2, W33°**Wnn and may repeat the process obtain- 
ing third approximation values win, We», Ws33,°°*Wnann and 
still other approximations should they be needed. Just as soon 
as the repetition of the process results in new weights which 
are identical with those used in obtaining them we have the 
proof that the regression coefficients have been found, since as 
pointed out (following formula 286) this is the unique property 
of the regression coefficients. Therefore if repeating the 
process a fourth time should give wi = Win, Wee2 = Wa, etc., 
we know that Win = Bo1.23 siete 9s) CU 2209) — Boo.12 Sen Wes, and 
the problem is solved. We will not expect identical agreement, 
but such agreement as is needed for practical purposes, say 
within .1 per cent, .or per cent, or whatever other limit is 
self-imposed. Presumably the larger the number of variables 
the more rapidly convergent are the successive approximations, 
but I am not able to supply the theoretical proof that the con- 
vergence must take place under all circumstances. A second 
check upon the general approximation to regression equation 
weightings may be found in the size of the multiple correlation 


304 STATISTICAL METHOD 


obtained. For convergence to be present this must increase 
for every step. 

The following example which has only six variables, and 
therefore constitutes a more severe test than would a problem 
having a larger number of variables, is given. The variables 
are: o, the criterion, being a measure of general scholastic 
success of school children in two successive elementary school 
grades (population about 300); the remaining variables are 
the scores made by the children in the five tests comprising 
one of the forms of the National Intelligence tests. 

(1) A test in arithmetical reasoning 
(2) A test in sentence completion 
(3) A test in logical selections of reasons for conduct. 
(4) A test in naming synonyms and antonyms. 
(5) A test in substituting digits for symbols. 
The correlations between scores are 


TABLE LXIII 


Variables 
oO I 2 a 4 
I -4017 
: -600. 2332 
Vari- 2 8} -233 
ratte ean 32379 .1986 1747 
4 .6807 .2569 -4520 .2628 
5 -3553 -1064 -2139 -0033 .2989 


The symbol c will stand for the composite score according to 
whatever weightings are used upon the five tests: the symbols 
(c — 1), (c — 2), etc., stand for the composite scores upon all 
five tests, except test one, except test two, etc. The problem 
is to make 79, a maximum. ‘Treating one of the five variables 
as unique and obtaining a composite score on the other four, 
gives us a three variable problem, the variables being 0, u, 
(c — u) in which wu stands for the unique variable, being in 
turn 1, 2, 3, 4, 5, and the regression equation being 


Zo = Bou.(c—u) Zu + Bo (c—u).u (¢ — Gh) cen Peg [287] 


The value of the second regression coefficient will ordinarily 
be in the neighborhood of 1.00, but it does not enter into our 


MULTIPLE CORRELATION 305 


present treatment. The first regression coeffcient is the new 
weight wzx, determined for z, and is given by 
You — Yo (c—u)fu (c—u) 
hy (c—u) 

Let s stand for the sum of the products of the correlations of 
the independent variables with the criterion into the weights 
of the independent variables, i.e., 

5S = Wifor + Wore + Wars + Waros + Weros .....----. [289] 
Let S stand for twice the sum of all product terms of the sort 
WyWwTu/, i.e., S in our present problem is a summation of 
2 X 10 terms as follows: 
S=2 (WiWoer12 + wiWsrig + WiW4ri14 + WiWs 15 + WewWsto3 + WeWaro4 

+ wewstos + WeWarss + WaWsr'ss + WaWsra5)..........+-[290] 
Let 2 S, stand for the sum of those terms in S (2 X 4 in number 
in our present problem) which involve wy. Thus S is equal to 
the sum of the S,, or in the present problein, 

Ss Ga Eh Seay 4a Gh SES n coz ccgocan oOo 
and finally let Sw® stand for the sum of the squares of the 
weights. That is, 

Sw? = w, + w2. + ws + w% + w% ............ [291] 
We readily obtain by formulas [163] and [149] 
o, = VSw? +S (Standard deviation of the ¢ composite 


so soon o oltre 


Wuu = 


SCOLE) GAs neaie tees aoe mee Soe oe [292] 

_ s (Correlation of criterion with the c composite 
Me Ge LOO) Wate EEE eT ND acs a Ista GS [293] 

o.2 VSw? + S— wy —2S, (Standard deviation of the 
c-u composite score). . [294] 

s 5 — wurou (Correlation of the criterion with the c-u 
ae ig a composite score)!). (fica dee [295] 


rs Sy (Correlation of the test treated uniquely 
ECM oa with the c-u composite score).........[296] 

It will be noted that if we have a problem involving one 
dependent variable and n independent variables that there are 
n terms in s, n(n — 1) terms in S, (n — 1) terms in S,, We 
now have all the requisite formulas and may proceed with the 
calculation. For our first series of weights we will take w = 2, 
W, = 4, Ws = 1, Ws = 5 and ws = 2, which are roughly pro- 
portional to the total correlation coeff.cients of the tests with 
the criterion. In the accompanying table p stands for the 


variable designated in the stub. 


306 


STATISTICAL METHOD 


‘9Tqe} BUIMOTIOJ oY} UT UMOYS se ssodoid oy} yeodar aM S{YZIOM MoU oso} Sus 
19 pd Nia Mids RAL Y Yq Yo! Yq IS() 


‘Surjedrorjue st yey} ynq ‘I'o se YyonuT sv Aq JOIIO UT SI S}YSIOM Oso} Jo oUlO OU 4eY} PUY 197"T [ITM oA4 


zIvi° Coch: oSfo" Cgce: Co6r’ (=O) = An S41 AA 
Lgl = uorjeunxoidde puosas 
=> nN—IonN 
20/5 = 4 o16z° ogtS: z09z" ott: zg6z° —— = ("—9)ny 
n-—o 
6££6S°6 = 20 | godd: 000: zegl reel o99f = 
79 =mgo+s 98188 Leceo’s btgz'6 9gzo'l 060g'8 Peo 
= ART AG: bggl Ll z60z'Se 0007°9g cob 6b w6O1S' LL = "oz — "Mm — 40 = ™-%0 
aie co Sa bona i = S= 
So = cccoer beers Oz16°S1 ggit'z Sooo oLSz's 99SS°L 
fa 72g = os 
v 06g6°z 9900 rainy 2a Scr gorl: Caves 
Cz 0686'z obi¢'1 ooto'6 069S'z Clore ¢ v 
“al 9900° Orie'r 8869" CLOe 6LEz° I £ |sorqeue, 
‘QI ZlILT ooto'6 gg69° zSQg'I ziotz v Zz 
y oSct 069S"z zl6e: zS9Q'1 $log" é I 
z(S7.M) aeytmsm, ee a AL te Ameo mem ymin, 4%, SIM 
¢ b c z I fo) 
SI1QDUD A 
AIXT ATaAVL 


307 


MULTIPLE CORRELATION 


ise) 
N 
S 
co b 
i 
“4 
ol 


04 


29 
0 
7 


uorjeurxoidde pry yp 


N—9 oO 


aN OSD 


@ 


sorqeiie A 


zSer: gSer t6z0° vec: of6r = 27) = "7 “SIM 
n—I90"mM, 

CLLOC: Sricse blogz zclelLy: Coz6z" = oan Cea) 
PrroLL: 6r0L69" 6rS6gL" SrLoel: SL6LOL = wage, = 
9Lo&L: 6cSLYr- 6zzgL° gogZs: Cr¢e1Z° = ™—0 

1gcrs: o06Szz" oboig’ SIbee: gSgoS° = "Gz — 7m — 49 = 
140¢0° 1Sgol° 9z900° L060" LS6¢o° oobzg: 

66L10° 10000" 88600" £Qz00° tL6to: Fr’ 
66210° 6£¢00° titgo: 660z0° oLz6z° er 
10000° 6££00° £2100" £1100" t1Loo’ COs 
88600° bitgo’ £L100° zQvIo OIg61° See 
£gzoo° 660z0° £1100" zgbvi1o° z¢glo’ 61° 
taytmnsin, ty 4A mnho a, Fe 47st a Tyt Zea, Mt a, 704%, SIM 
S 4 ¢ z I oO 
‘ 
S91QD1AD A, 


AXT FUTAVL 


308 STATISTICAL METHOD 


Tabulating the results thus far obtained, we have 


TABLE LXVI 
WEIGHTS belie VRests 

HIRD 
: 2 19 .1940 
: t 33 3240 
3 Z -03 .0294 
4 5 .43 .4358 
5 2 14 -1352 

Multiple correla- 
tion resulting 7877 .79005 


The first weights give a multiple correlation of .7877 and lead 
to the determination of the second approximation weights. 
The second weights give a multiple correlation of .79005 and 
lead to the determination of the third approximation weights. 
The third weights differ so slightly from the second that for 
ordinary purposes one would stop the calculation here, use the 
third weights as final and take the multiple correlation as equal 
to .7901 since it will be a trifle above .79005. The method 
of calculation of the weights here shown involves but a frac- 
tion of the time necessary to evaluate the determinants neces- 
sary toa solution. This is true for three reasons: 

(a) Number of operations is much smaller. 

(b) No checking for inaccuracies in any of the calculations, 
except that for the last weights derived, need be made, as a 
small error leading to a wrong approximate weight will be 
corrected in the next step. 

(c) Partial regression coefficients Bou.(c—w, except for the 
last step where greater accuracy may be desired, may be made 
by the aid of the alignment chart. 

A further device which is serviceable is to compare fo- with 
each of the roc¢—y values in the same calculation. Should 
any one of the roi—y correlations be larger than ro, it indi- 
cates that the weight used for the test in question is worse than 
would be a weight of zero. Referring to the first of the cal- 
culations above, we find that ro. = 7877 and that ro¢—3) 
= .7882. This means that the weight which was assumed for 
test 3, namely 1.00, is a worse weight than would be the weight 
zero. Thus if the problem is such that only positive weights 
have been used as the first approximations, any variables 


MULTIPLE CORRELATION 300 


which should have negative weights will probably be discovered 
in the first calculation by the correlation ro-_y» turning out 
higher than ro. 

The solution by determinants of the above problem correct 
to seven decimal places has been kindly supplied to me by 
Miss Ella Woodyard. 


W, = .19412341 Ws = .43693997 
We = .32392693 Ws = .13466545 
W3 = .02748474 10.19345 = -79009053 


It will be seen that the maximum error in the third approxi- 
mation weights is .oo1g, which is the error for w3. This would 
probably be considered a negligible error. Should, however, 
greater accuracy be required, a determination of fourth order 
approximation weights will give it. Actually such calculation 
gives weights, no one of which is in error by more than .ooot. 
I have also made a fifth calculation resulting in the multiple 
correlation 1.12345 = .79009038 which is seen to be in error by 
00000015. Thus for these data there can be no doubt that 
rapid convergence actually exists. One desiring to practice 
the method is referred to Yerkes (1921), where abundant 
multiple correlation equation material already worked out by 
the determinantal method is to be found. I have used this 
method upon a variety of problems and have always found 
convergence. Much time will be saved if the original guess 
as to the final weights are excellent, but the method does not 
require approximate accuracy in the original weights. To il- 
lustrate this, let us work the present problem, starting with 
Weignts.O, 1,2, — 2, — 1; which are about as unreasonable 
as it is possible to assume. The calculation gives 


TABLE LXVII 


| VineTeRS WEIGHTS Vea 
SECOND HIRD 
Ne pe First GUESS APPROXIMATION APPROXIMATION 
I fe) “fal .188 
2 I As; 362 
3 2 P74 .031 
4 = 2 7 437 
5 —I 3 -I51 
Multiple corre- 
lation resulting — .23 .784 


310 STATISTICAL METHOD 


Evidence of convergence is not clearly apparent from these 
three series of weights, but it of course is apparent by com- 
parison of the third weights with the correct values. The very 
poor choice of original weights has increased the number of 
calculations necessary to establish convergence, but it has had 
no other effect. 

A possible difficulty in the calculation of the Bou.(c—1 CO- 
efficients in case one of the approximate weights is zero may be 


mentioned. In case w, = 0, 


SY te) 
fu(c—u) = oe ee eta eee enananes [296 a] 


To avoid this indeterminate form we may write 
ru (c—u) = ilk I [297] 
Oc—u 
instead of the preceding, which is generally shorter to use. 
As an illustration of this situation it may be noted that w; was 


chosen equal to o in Table LXVII. Thus S; = 0 and mc-1 
= : by formula [296]. Using formula [297] we have 


rreea= Wetie + mre Leen + wsrrs = ee = .0037 

This is no longer indeterminate. Except in this calculation 
of ruic—u) NO special procedure will be necessary on account of 
a zero weight. The introduction of zero weights where reason- 
able leads to a simplification of the numerical work. For 
the problem in hand, if the first estimated weights had been 
2, 4, 0, 5, 2 instead of 2, 4, 1, 5, 2 it would have simplified the 
first calculation and led to rapid convergence. It is well to esti- 
mate a zero weight whenever in doubt. The regression weights 
as just determined are of course 6 coefficients, w; = Bo.23 . 
We = Bor.13.. -n, etc., pertaining to the equation [260] 

Zo = Bor-23...m21 + Boaas...n 22 +++++ Bon. 12... -1 Sn 
Making the substitutions of equations [247] and [248] immedi- 
ately gives the regression equation involving gross scores 

Xs = Duitacas wy esata ee + 22> + don.ig...n1An +e 

The regression coefficients and the multiple correlation co- 
efficient are given by this successive approximation method. 
The partial alienation and correlation coefficients, as well 
as the important standard errors, may all be obtained by for- 
mulas given earlier in this chapter. 


* My 


CHAPTER XII 


STATISTICAL TREATMENT OF SUNDRY SPECIAL 
PROBLEMS 


Section 85. STATISTICAL CONSTANTS DETERMINED FROM 
MutTILAtep DISTRIBUTIONS 


If a portion only of a distribution is available it is possible 
to reconstruct the entire distribution when a reasonable assump- 
tion of the form of the entire distribution can be made. The 
principle is applicable to any form, but only in case the assumed 
form is normal are the constants enabling a ready calculation 
available in tables. Let us assume that data for the tail of 
a sharply truncated distribution, which is in truth normal, 
are available. The ‘‘tail’’ may be greater or less than one-half 
of the total or untruncated distribution. The distance from 
the stump to the mean of the tail bears a ratio to the standard 
deviation of the tail which changes as the point of truncation 
changes; conversely, the value of this ratio determines the 
proportion of the total distribution which is represented by the 
tail. This is the property utilized by Pearson and Lee (1908), 
and by Lee (1914), in reconstructing the total distribution from 
a sharply truncated portion. Tables facilitating this process 
are to be found in the references cited. 

There are other properties, such as the ratio between the 
median deviation and the mean deviation of the tail measured 
from the point of truncation, which can be utilized to the same 
purpose, and it is not at all evident that the error of such 
determination is greater than that of the Pearson and Lee 
determination. The probable errors which establish the relia- 
bility of either method are at present unavailable. The ac- 
companying Table, LXVIII, gives the ratio of the median 
deviation from the stump, to the mean deviation, for successive 
percentages of a total normal distribution. 

311 


aT. STATISTICAL METHOD 


TABLE LXVIII 


MEDIAN MEDIAN MEDIAN 
Z MEAN o MEAN ! MEAN 
.OL .7363 34 8143 67 -8833 
2 7425 35 8162 68 8858 
3 -7470 36 8181 69 .8884 
4 -7508 37 .8199 70 -8909 
5 54 38 -8218 7p: -8935 
6 7571 39 .8237 72 -8962 
7 7599 40 .8256 73 -8988 
8 -7625 41 .8276 74. -9016 
8) .7650 42 8295 75 9043, 
10 .7674 43 8314 76 9071 
II -7697 44 8334 77 -9100 
12 -7719 45 8353 78 -9129 
13 7741 46 8373 79 -9159 
14 .7762 47 .8393 80 -9189 
15 ihe? 48 8413 81 -9220 
16 -7803 49 8433 82 +9252 
ity/ -7823 50 .8453 83 -9284 
18 -7843 51 8474 84 -9317 
19 .7862 52 8495 85 -9350 
20 7881 53 .8516 86 -9384 
21 .7901 54 .8537 87 -9420 
22 -7920 55 .8558 88 -9456 
23 -7938 56 .8580 89 -9492 
24 -7957 bys .8601 go +9530 
25 .7976 58 .8623 gI -9569 
26 -7995 59 .8646 92 -9610 
277, .8013 60 .8668 93 .9651 
28 .8032 61 .8691 94 -9694 
29 .8050 62 .8714 95 -9738 
30 .8069 63 .8737 96 -9785 
31 .8087 64 .8761 97 -9833 
22 .8106 65 .8785 98 -9884 
33 .812 66 .8809 99 -9939 


Entering Table LXVIII with the ratio given by the data 
leads to q, the proportion in the tail, and thus to N, the popula- 
tion of the total untruncated distribution. The further steps 
in the solution will be obvious from the problem discussed in 
the next paragraph. 


SUNDRY SPECIAL PROBLEMS 313 


It not unfrequently happens that the total population is 
known, so that the items available are (a) g, the proportion in 
the tail, (b) the point of truncation, and (c) the distribution 
of the tail measures. In this case the fitting of an assumed 
normal distribution is very simple. Let m = the mean of 
the tail measured from the stump; let D = the distance from 
the mean of the total distribution to the stump; let o = the 
standard deviation of the total distribution; and let x and z 
have the values of Table K-W when entered with the argu- 
ment g. We then have, from formula [53] 


n=2,orD=x0 Me eat payers Mee SOO 
Bet Uy ee LS Ae eee [299] 
qd o o z 

——* 

q 


Solving these two equations for ¢ and D completes the problem. 

As an illustration of the use of Table LX VIII we may calcu- 
late, from the data of Table LXIX, the constants of the total 
grade distribution of 15-year olds knowing the grade distri- 
bution of the portion found in the elementary school. The 
children represented range from 13.5 to 14.5 years of age. 
We will assume that the total grade distribution is normal and 
that the elementary school portion is a sharply truncated tail, 
though in case the compulsory school attendance law applies 
only to the elementary school this assumption is undoubtedly 
in error, leading to a larger estimate of the number in the 
high school than would actually be found there. In the grade 
scale used, 3.0 means the beginning of the third grade, 3.25 
the middle of the low third, 3.75 the middle of the high third, 
ue 

TABLE LXIX 


Grade Distribution of 14-Year Olds Obtained from Certain Virginia 
Survey Data 


GRADE 3.25 3-75 4.25 4.75 5-25 5-75 6.25 6.75 7.25 7-75 8.25 8.75 Total 
NUMBER 
ones nt 2D A % Wey sei Kore Wey By Gly WO) Byh “bie 


The point of truncation is 9.00. Calculation gives 


Mdn measured from 9.00 = — 1.685 
M measured from 9.00 = — 1.835 
Mdn 


oe, .g181 


314 STATISTICAL METHOD 


From Table LXVIII, q = .7975. This proportion is repre- 
sented by 411 pupils, so that the number in the untruncated or 
total distribution is 515 pupils. The standard deviation of 
the total distribution is, by formula [299], equal to 1.521 grades, 
and D, the distance from the stump to the mean of the total 
distribution, is found by formula [298] to equal 1.266 grades. 
Accordingly the constants of the untruncated distribution of 
fourteen-year olds are 

Mean grade = 7.734 

Standard deviation = 1.521 grades 

Population = 515 pupils 


Section 86. CORRELATION DETERMINED FROM MUTILATED 
DISTRIBUTIONS 


The ability to determine the constants of a total distribution 
from a known fraction of it may be turned to practical account 
in decreasing the size of populations necessary for an assigned 
accuracy. The procedure may be illustrated by a problem, 
the data for which have been kindly supplied by Miss Mar- 


garet V. Cobb. 
TABLE LXX 


Numbers of Pupils Obtaining Designated Scores upon a Symbol-Digit 
Substitution Test 


ScHoOoL GRADES 
TEST SCORES 


4-25 4-75 8.25 8.75 

105 4 6 
100 4 5 

95 I 3 

90 7 3 

85 5 I 

80 I 4 

if 3 I 2 

70 I I 

65 I 

60 2 I 

Sy) 4 3 I 

50 3 2 

45 4 2 

40 3 I 

30 2 

30 2 

25 I I 

20 I 

28 14. 25 21 N = 88 


SUNDRY SPECIAL PROBLEMS 315 


The problem which we will set is, in outline, to (a) calculate r 
from this mutilated table, (b) determine Rk, the correlation to 
be expected in a range of two grades, let us say the fifth and 
sixth, (c) determine the probable error of R as thus found, 
(d) determine the probable error of an R of the same size (desig- 
nated R’) if found from a population of the same size in grades 
s and 6, and (e) by comparing the reliability of R and R’ 
endeavor to ascertain whether an artificial selection of original 
data will decrease the populations necessary to secure a desired 
reliability. 

Letting school grade be the first variable, and test score the 
second, we find ry = .827. 

If we can determine 2;/o1, where >; is the standard deviation 
of the 5 and 6 grade distribution, and o; that of the 4 and 8 
grade distribution, we may use formula [86] to obtain R. 
Assuming that there are the same number, f, of pupils in each 
grade we have the two following distributions: 

and 8 grade (Grades 25 4.75 8.25 8. fds 
; Btrsedon eas f ie e : if : i : giving PEERLESS 
nd 6 grade (Grades 25 5.75 6.25 6. ent 
: eeciinn ae tee a : : if : a {giving 2a 3175 beads 
from which the ratio 21/01 = .27735. Having this ratio and 
ro we find by formula [186] that R = .378. Thus the correla- 
tion in a two grade range is rather low. 

By formula [108 8], o, = .317/VN, but this is too small a 
value, as the distributions with which we are working are far 
from mesokurtic. Estimating the (o's for the school grade and 
the test score distributions to be 1.06 and 1.94 respectively 
gives by formula [108 a], o, = .stsWVN, which is the prefer- 
able value in the case of this platykurtic correlation surface. 

If the assumption of form of grade distribution can be made 
with great certainty, so that we may consider no error to enter 
into the ratio S/o, we may obtain the standard error of Kk 
knowing that r. Starting with formula [187] and taking loga- 
rithmic differentials we have, 


316 STATISTICAL METHOD 


Substituting these values for dk and dK, squaring, summing, 
dividing by the population, and extracting the square root, 
gives 


oy TR 
rk? RK? 
or : 
_ _ RK? (Standard error of the correlation coeffi- 
7R ~ Or ype cient inferred from a coefficient obtained 
in’a different range) ree ee eee BOO! 
Using this formula we find for the data in hand, 
oR = 1.2387 0, = .638/ VN idle cada (a) 


Had the correlation been directly determined from the 5 and 
6 grade distribution, its value would presumably be about the 
same A; = .378, but its standard error would have been 
different. Estimating the #:’s to be 2.1 and 3.0, instead of 
1.06 and 1.94, as above, the standard error by formula [108 a] is 

@ pe OTST VON 12 cx sas ence nokos ak 


Choosing such an N for formula (b) as to result in the same 
standard error as given by formula (a) shows that 1.87 N are 
needed in the narrow 5 and 6 grade calculation to obtain an 
equally reliable result to that deduced for these grades by the 
4 and 8 grade calculation based on N. 

One cannot generalize and say that, given equal populations, 
more reliable results are always obtained from the wider range 
determination, but this is true if correlations are low, in the 
narrow range and not very high in the wide range — say 
under .4o in the former and not over .7o in the latter. If 
entire freedom in choosing the range of talent to be examined 
is present, excellent results may be expected if a fairly meso- 
kurtic distribution, yielding a correlation between .60 and ay Loe 
can be selected, and then estimating the correlation for greater 
and lesser ranges by formula [186]. 


Section 87. THr PROBABLE ERROR OF PERCENTAGE MEASURES 
OF OVERLAPPING 


The probable error of the proportion in one distribution 
which exceeds or falls short of a certain percentile in a second 
distribution is a function of both distributions. Let the con- 
stants of the first distribution (to the right in the accompany- 


SUNDRY SPECIAL PROBLEMS 317 


ing figure) be designated by lower case letters and those of the 
second distribution by capitals. Let p = the proportion of 
the first distribution falling short of the percentile X, of the 


{st 
DISTRIBUTION 


2ND 
DISTRIBUTION 


second distribution. A change in p may be produced either 
by a change in X, or by a change in the proportion in the first 
distribution below an assigned point. 
Let 6 = asmall change in the proportion p due to fluctua- 
tion in the second distribution. 
Let d = a small change in the proportion p due to fluctua- 
tion in the first distribution. 
Let A = a small change in the proportion p due to fluctua- 
tions in both distributions. 
Then A = 6+d, and aoa, identical with oy, is the standard 
error desired. 
DA? = D6? + La? + 2 Did 
Since 6 and d are functions of two independent distributions 
they are uncorrelated and 2éd = o, so that 


PS Og Aig dekoyeen, ec ag OY Io ners [301] 


oq is the standard deviation of the proportion of measures in 
the first distribution below the point X and by formula [40] 
ow = V pq/n- 

If the ordinate of the first distribution per unit base at the 
point X is f and if the distribution is assumed sufficiently flat at 
this point that a small change to the right in X would pass over 
approximately the same number of cases as an equal change 
to the left, then a small change D in X causes a change of fD 
in the number of cases, np, of the first distribution lying below 
the point X. Dealing with proportions, p is affected to the 
extent fD/n. In consequence, 


mmc 
n 


318 STATISTICAL METHOD 


In this equation f and u are constants, for we are considering 
fluctuation due to variability in the second distribution, so that 


a5 = s ap _ (See problem 7, Chapter 4) 


op is simply the standard error of a percentile. We have by 
formula [42]; letting P = the proportion of the second distri- 
bution determining the point X; Q = 1 —P; tp = the num- 
ber of units in the class interval in which X lies; fp the fre- 
quency of this class; and N the population of the second 
distribution; 
1°p NPQ 
SD Pp 


Making the proper substitutions in [301] results in 


»= (4) wae te) + (A) 


(Square of e standard error of the proportion of a dis- 
tribution falling short of or exceeding an assigned 
percentieot 2 second cistiibmhon) ane eee eee OST 


Note that in this formula the constants in ( ) refer to the first 
distribution and those in the [ | to the second distribution. 

If the proportion exceeding the median of the second distri- 
bution is being determined, P = Q = 4; and if, further, the 
second distribution is normal, fp/ip = .3989N/Z, in which > 
is the standard deviation of the second distribution, so that 


| NR a: 
Nn? ae n 


(Square of the standard error of the proportion of a dis- 
tribution falling short of or exceeding the median 
of a second and normal distribution) ..............[303] 

In case both distributions are normal and have the same 
populations and standard deviations, Table LX XI when multi- 
plied by 1/VN gives the standard errors, in the second column, 
and the probable errors, in the third column, for different values 
of p. 

In illustration of the use of Table LX XI the following prob- 
lem is given: In a certain fifth grade only 4o per cent of the 
pupils exceed in a reading test 50 per cent of the fourth grade. 
We will assume the same number of pupils, 36, in each grade. 
What are the chances that the true test ability of the fifth 
grade is above that of the fourth grade? Referring to Table 
LXXI we find that the standard error of the proportion, .40, is 


o?, = 1.57080 2? 


SUNDRY SPECIAL PROBLEMS 319 


(.689/V36 =) .11s. Thus the difference between the ob- 
tained proportion and the proportion in case of equally able 
classes, namely .10 is (.10/.115 = ) .87 standard errors. Enter- 
ing Table K-W with x = .87 we obtain q = .19, or, in other 
words, the chances are 19 in 100 that the fifth grade ability is 
in truth as great as that of the fourth grade. 


TABLE LXXI 


VN X THE STANDARD ER- 
PROPORTION LYING BELOW RORS OF THE PROPOR- Ra 
oR ABOVE MEDIAN OF TIONS OF ONE DISTRIBU- VN X THE P. E.'s 
SECOND DISTRIBUTION TION BELOW, OR ABOVE, 
THE MEDIAN OF A SECOND 


.OOI 032 .022 
OI -105 .O71 
.02 -153 -103 
05 252 .170 
10 Bue 251 
5 .462 cei! 
.20 532 359 
25 588 396 
.30 .622 426 
35 .666 -449 
.40 .689 .465 
“45 +793 474 
50 707 477 


If, for this same problem, fourth and fifth grade means are 
calculated and the probable error of the difference between 
means found by formula [140] we will finally obtain the result 
that there are 14.4 chances in too that the fifth grade ability 
is in truth as great as that of the fourth grade. Thus slightly 
more definite results may be obtained by finding the differences 
between means instead of the percentage of overlapping. 
Formula [166] of Section 59 provides the correction for the 
error in a measure of overlapping due not as here to size of 
population but to inaccuracy in the instrument of measurement. 


Section 88. A CRITERION FOR THE ADDITION OR ELIMINATION 
oF ELEMENTS Havine Firxt WEIGHTINGS 


In many trade, education, and intelligence tests, and in 
combining stock quotations to determine general trends, itis 
frequently required, because of the necessity for maintaining 
simplicity of procedure, to include an item in a composite at a 
given weight, or to reject it in toto, ie., no adjusting of the 


320 STATISTICAL METHOD 


weight to the importance of the item is possible. A criterion 
for the inclusion or rejection of an item is needed for the handling 
of this problem. 

To make the problem specific let us suppose that a questions, 
each scored right or wrong, are being evaluated with reference 
to their excellence as a ten-year-old general intelligence test 
battery (such, for example, are the Binet type of questions). 
The correlations of each of the a questions with an independent 
general intelligence measure and the intercorrelations between 
the questions constitute the requisite basic data. Having 
these and using the weights that are imposed, calculate corre- 
lations exactly as in the row labeled ‘“‘ro¢_,)”’ in Table LXIV. 
The highest of these correlations locates the question which 
contributes least. This question may be discarded and the 
process repeated with the (a — 1) remaining questions, etc., 
until the number desired for the final battery are left. At each 
step in this process a comparison of the roic-_,) correlation 
with the ro, correlation shows how much loss, if any, in multiple 
correlation results from discarding the question, thus making 
available all the information pertinent to the problem. 

All correlations should be Ly the usual product-moment 
method, even though but two degrees of merit are possible. 
For the intercorrelations formula [214] may be used. 


Section 89. TrapE TEst CALIBRATION 
A procedure of evaluation, or, “calibration,” of trade test 
questions, based upon the slope of an ogive curve, has been 
practiced by the Army Trade Test staff. As an illustration 
let us suppose questions A, B, C, and D have been correctly 
answered by varying proportions of unskilled and skilled arti- 
sans as shown in the following Table: 


TABLE LXXII 
Percentages Answering Correctly 


NOVICES APPRENTICES | JOURNEYMEN| EXPERTS 
OWES IN G5 me 10 14 18 24 
Ouestions Saturn. 2 2 51 60 
Question. C . 6 « » 20 62 70 75 
OGEStion tern. 2 I 14 54 


SUNDRY SPECIAL PROBLEMS 321 


I have elsewhere (Kelley, 1916, simp.), pointed out, in the 
case of an ogive curve in which the abscissa is a scale of diffi- 
culty, and the ordinate per cent correct responses, that un- 
correlated questions of the difficulty corresponding to the point 
of steepest slope result in more accurate determinations of 
ability than a similar number of questions of a different diffi- 
culty. The principle is clearly general, and can be used to 
scale a question given subjects of known differences in ability 
just as, in reverse, it can be used to determine proficiency 
when given scaled questions. Thus, if ogive curves, the abscissa 
being Novice-apprentice-journey-expert and the ordinate per 
cent correct responses, be plotted for each question, the steep- 
est part of the curve will lie between the two groups most 
decisively differentiated from each other by the question. 

Inspection shows that question A is not satisfactory either 
as an apprentice, journeyman, or expert question; that ques- 
tion B is an excellent journeyman question; C an excellent 
apprentice question; and D a good expert question. So far 
as determining the trade group with reference to which a 
single question will be of most value the method is excellent, 
but it falls short, as will every method not involving intercor- 
relations, of what is to be desired in a method used to select 
a battery of questions. A combination of this procedure with 
that of the previous section should give good results. 


Section 90. Tur DETERMINATION OF THE CROSS-OVER VALUE 
OF A CHROMOSOME SECTION * 


in the following treatment certain terms will be used with 
meanings which may be made clear by an example: If a fly 
showing two mutant characters, black and vestigial, is crossed 
to a fly showing neither of these characters, then in the back 
cross progeny the characters will reappear in the orginal 
combinations, namely black vestigial or not-black not-vestigial, 
in the majority of cases, but small classes of progeny will occur 
that are recombinations of the original characters, namely, 
they are black not-vestigial, or not-black vestigial flies. 


* I am indebted to Dr. Calvin B. Bridges for the biological statement of this problem. 


322 STATISTICAL METHOD 


To explain the occurrence of these recombinations it is 
assumed that crossing-over occurs in the section of the chromo- 
some between the loci at which the genes for these characters 
are situated. The gene responsible for the development of 
the character black is situated in a rod-like body called a 
chromosome at a definite point which is the black locus. Like- 
wise the gene for vestigial is situated in that same chromosome, 
the “second”’ at a locus some distance to the right of that of 
black. The second chromosome is represented twice in every 
cell — by the chromosome from the mother carrying the genes 
for black and vestigial, and by the chromosome from the 
father carrying in these loci the gene for not-black and the 
gene for not-vestigial. In the production of eggs these two 
chromosomes, A and A’, come to lie side by side and homolo- 
gous sections are interchanged by crossing-over. Both chro- 
mosomes break in two at a corresponding point and the left 
part of A joins to the right part of A’ and vice-versa. The 
cross-over occurs at random along the chromosome. When- 
ever one occurs between the loci of black and vestigial, a 
black not-vestigial and a not-black vestigial chromosome are 
produced and these give rise to the character recombinations. 
However, two occurrences of crossing-over may take place 
coincidentally between these loci and not be detected as a 
recombination of the characters. Again, if three cross-overs 
take place between these loci only simple recombination is 
observed. Accordingly, unless the section is so short as to 
preclude double crossing-over, the number of recombinations 
is always less than the number of cross-overs. 

The first problem of the student of this subject is to determine 
the number of cross-overs from the number of recombinations. 
This problem offers certain difficulties, but for our present 
problem we will assume it solved by an equation of the type 


100 7 = 100 (R-+2d) (The cross-over value of a 
chromosome section). . . [304] 


in which R is the proportion of recombinations observed to 
take place, d the proportion of double (plus occasional triple), 
cross-overs, expected from previous determinations in this 
general chromosome region, when the proportion of recombina- 


SUNDRY SPECIAL PROBLEMS 323 


tions is R, and 100 n is the cross-over value of the section 
studied as given by the experiment. 

The second problem is the determination of the reliability of 
the cross-over value determination. This offers genuine 
statistical difficulties due to the variability in the ratio d/R 
for different lengths of chromosome and for different general 
regions in the chromosome. I offer the following as an empiri- 
cal formula, which I believe will not be far from the mark, at 
least as long as uncertainty as to the ratio d/R persists: 

100 ¢,, = 100 (cp + 2 a4) 
(Empirical formula for the standard error of the cross-over value) .[305] 


in which op is defined by the equation 


op NEED 


and og by the equation 
ld (1 — d) 
a N 


N in each case being the total number of flies in the experiment. 
Having, either by means of formula [305] or otherwise, an 
estimate of the standard error of a single cross-over value 
determination, we come to the third problem, which is: 

The utilization of several direct and indirect independent 
determinations of the length of the same chromosome section 
to arrive at the most probable value. 

Let 100 m2. = an experimentally determined cross-over value 
between loci 1 and 2, and let too cy, = its standard error: 
and similarly for n’s and o’s with other subscripts. 

If a number of loci in order are X,, X»2, X3, X4 and if different 
experiments have been conducted so that there are separate 
determinations of (a) 113, (b) m2, and (c) m3, the problem then 
is to use these three determinations to arrive at the most 
reliable value for the distance between X; and X3. We will 
call this most reliable value m3. We have two determinations 
of the same distance, namely, 13 and (m2 + m3). The stand- 
ard error of 13 is 013, and since my. and m3 are independent 
determinations they are uncorrelated and the standard error 
of (M12 + M23) is V 6712 + 023. To average these two distances 
so as to secure a distance with the minimal standard error we 


324 STATISTICAL METHOD 


must weight each inversely as the square of its standard error 
as proven in the next section, formula [307]. Accordingly, 


113 N12 + No 
= 713, o712 + o723 
n13 = 7 ie elie [cae 
0733, a 712 + 723 


Should there be a third independent means of determination, 
e.g., were independent values of m4 and 134 available, the pro- 
cedure would be similar, giving 


113 N12 + No Nig — N34 ~The best value for a dis- 
e dtp tote + ates ota Pb ot tance scaled in three 
eer I I independent ways: (a) 
o713 | 0213 + 073 | 0714 + 07% in toto, (b) in parts, 

(G) At) parts) seen [306] 


Any further number of independent determinations may be 
utilized in the same manner. It may happen that the number 
of possible means of determination is so great as to make the 
labor of utilizing all of them excessive, in which case certain 
clearly defined loci preferably between 30 and 4o units apart 
may be carefully determined using all the data and other 
points located with reference to them using data between two 
loci already scaled. 


Section 91. THE Brest WEIGHTED AVERAGE OF INDEPENDENT 
VARIABLES 


To complete the proof of the preceding section it remains 
to establish the theorem that the best weighted average of n 
independent measures of the same magnitude is that obtained 
by weighting each inversely as the square of its standard error. 

We will first prove it for two variables, a; and as, having 
standard errors o; and o. It is required to so distribute the 
total weight of 1.00 between a, and a» that the standard error, 
o-, of the weighted average, a, shall be a minimum. Let 
the weights be wv; and w». We have 

WwW, + Ww. = 1 
1= wid; + (I — wy) ae 
o*_ = wo) + (I — w))2o%, +2 Wi (I — w) ooo fie 
in which ry» is equal to zero as a, and a» are by hypothesis 
independent measures, so that 


2 a 2 : 
o°— = W*) 0°; + 072 — 2 wo + Ww; oe 


SUNDRY SPECIAL PROBLEMS 325 


Differentiating with respect to wy, setting the derivative equal 
to zero, and solving for w; and w» results in 


I 
Wi oy 
W2 I 
a2 
I I 
6 dese 
¢ = ———_ 4 
ze I (Best weighted average of two 
oy a2 independent measures) .. . . [307] 


If a third variable which is independent of the first two is 
included it cannot change the best relative weightings of the 
(irst two, therefore 


COS Onl 
amare cies eon eo (a) 
a2 


must still hold, and by parity 


i 

Cie o*1 

ie ee (b) 
os 
yi) 

W2 be a5 

ae =s ca sisi colbsusitete falls] sicaceiene kets: (oLeikeltsicerenalolieie (c) 
a3 

Further, the sum of the weights must equal 1, that is, 
w, + we +w=1....... Pons ACD) 


By inspection it is seen that the weights in the followin equa- 
tion meet these four conditions: 


I I I 
= hh ae a OO ieee 8 


a a os (Best weighted average 
ate I I I of three independent 
Gy) 072 as measures). .....-.-1308] 


Having four conditions to meet and but three weights this 
solution is unique. It is obvious from the steps involved that 
the proof may be extended to cover any number of variables, 
so that in general 


I ii I I 
= dy = ae Es et to 
eos = oy z o% ug sf a, " (Best average of n 
= } é 
I I I gt independent vari- 


oy eMaicher ih eo LCS) Me GOO) 


326 STATISTICAL METHOD 


Section 92. PsycHopHysiIcAL METHODS 


The excellent treatment of the statistical processes involved 
in the handling of the various psychophysical methods given in 
Brown and Thomson (1921) makes an exhaustive treatment 
here unnecessary; however, the very important process of 
fitting smooth curves to data collected by the “constant 
method,” or the ‘‘method of right and wrong cases,’ is treated 
of in connection with Fechner’s fundamental table of the 
normal probability integral. Table K—-W is so much more 
serviceable in this connection, both because of the type of 
entry which it contains and because of the greater accuracy 
which it permits, that the process is herewith described in full. 

When successive stimuli, s1, se, 53,-*-Sm, are each compared 
a number of times, Ni, No, N3,---Nm, with a constant stimulus, 
k, and the subject is required to act in each case by calling the 
stimulus greater than or less than the constant stimulus, there 
results a progression of proportions, p1, pe, p3,:**Pm, giving 
the proportion of times that each stimulus is considered greater 
than the standard of comparison, k. If the smallest of the 
variable stimuli is much smaller than k and the largest is much 
larger, the proportions will run from .oo to 1.00 and if plotted 
will give an ogive curve. If the smallest and largest stimuli 
are not sufficiently different from A to lead to proportions of 
.oo and 1.00 at the extremes, some reasonable assumption as 
to the distribution of these tail measures must be made. From 
the general nature of the ogive curves found in psychological 
data obtained as described, it has been surmised that the 
integral of a normal curve may ordinarily be taken as well 
representing the distribution of proportions in the tails as well 
as in the more central portion of the curve. 

The problem is, therefore, to fit a curve of the type 


to the observed data. The magnitude 5 is that stimulus at 
which p equals 3. It is not an observed value of s, but is to be 
determined from all the data; s is any one of the variable 
stimuli; o is the standard deviation, in terms of the units of oh 


SUNDRY SPECIAL PROBLEMS B27 


of the normal distribution of which p is the integral. Accord- 
ingly (s — s)/o is a deviation from the mean expressed in terms 
of the standard deviation, that is, it is comparable to x of 
Table K—-W, and yo/N is comparable to z of that table. Assume 
that the ogive curve is the integral of the normal distribution 
eS \2 
pe ee, 
oV2 Tw 

Each proportion, 1, pe, p3,°**Pm, is a fraction of the area under 
this curve and for each such proportion there is a value %, %2, 
%3,°**%Xm, which may be obtained from Table K-W. 

Even if the values of s and o have been determined in the 
best possible manner there will still be discrepancies between x; 
and (s; — s)/a; x,.and (sz — 5)/o; etc. due to the best fit ogive 
not being a perfect fit. The problem may now be restated in 
more specific terms. It is required to determine s and o 
(in the parlance of psychology, s is the threshold and o is the 
dispersion of the measures giving the threshold), so that the 
sum of the squares of the deviations, [xv —(s — s)/o], shall be 
a minimum. 

In the early statement of the problem by Fechner and 
Miller it was argued that the sum of the squares of the devia- 
tions of the obtained 7’s from calculated p’s should be made a 
minimum, but as Urban (1909), (1912) and Thomson (1919 
dir.), have shown this is plainly in error and the deviations in 
the x’s, as indicated above, are the proper ones to treat by the 
method of least squares. 

For each proportion p there is an x which differs from 
(s — s)/o by a certain amount, and the standard error of this 
difference is identical with the standard error of the x, for s has 
no error in it, being a given stimulus. If, therefore, the stand- 
ard errors of %1, %, %3,°°*X%m are obtained, we know exactly 
what weights to give to the m derivations in arriving at the 
best values of s and oc, for by the theorem of the preceding 
section, independent measures of unequal reliability should be 
weighted inversely as the squares of their standard errors. 

The deviation x is simply a percentile value, and the standard 
error of a percentile [43] has been shown to be equal to 


o 


328 STATISTICAL METHOD 


Accordingly the m residuals, [v1 — (s1 — s)/o], [v2 — (so — s)/o}, 
-- must be weighted 
N27; Nose 
o*poqi’ o*poq2’ ay 
respectively. Since o? is a constant for the entire procedure, 
it may be dropped without affecting the relative weightings. 
Ni, No, «++ depend upon the particular experiment. The 
remainder is z?/pg and is the product of the entries in the 
z/p and z/q columns of Table K-W. Except for the factors 
N and o? these weights are simply the squares of the reciprocals 
of the standard errors of successive percentiles of a normal 
distribution. The proportional magnitudes, 272*/4 pq are 
the weights of Urban’s Table. The factor 27/4 was chosen 
by Urban merely in order to make the maximum weight 1. 
We may consider two cases in applying these weightings: 
First: When neither o nor s is known. In this case the sum 
of the squares of the residuals 


(s 5) (xs 242), ..- (1m — +2) 
oa o o o o o 


is to be made a minimum, after each has been given its ap- 
propriate weight, wi, we, --*Wm, as defined by equations [310]. 


ze Z ae Constant metho 

cee - we = Noor wm = Nm eae weights) . . ce 
The magnitudes °/fq are readily obtained, being the product of 
z/p and z/q of Table K-W. The magnitudes N are the 
numbers of cases in the successive experiments. By the usual 
method of least squares, the required values of 1/¢ and s/o are 
given by the solution of the two following simultaneous equa- 
tions, in which » indicates a summation of mm terms: 


Swe —tzws+szw=o a ah grit Wily ne N Gc A) ay mies tee ae (311] 
o o (Normal equations for threshold 
* 5 and dispersion calculations) 
= wxs — ~Z ws? + —Z ws =O) saitl-dishudieincie tes ne een S12) 


Second: When the chief concern is with the determination 
of precision of judgment and s is known without experimental 
determination. Such situations may arise in the derivation 
of educational and psychological scales, such as drawing or 
composition scales where s is taken as equal to K. In this 


SUNDRY SPECIAL PROBLEMS 320 


case equation [311] only is necessary, as s = K, a known 
quantity. Solving [311] for « we have 


_2ws—Krw (Calculation of dispersion when 


of D wx the threshold is known)... .[311 a] 


The following problem illustrates the steps involved in the 
method. The data are drawn from the educational field to 
show the value of psychophysical methods in a much wider 
field than that to which they are usually limited. 

A judge is called upon to rank an English composition as 
better or worse than 4o standard compositions which are 
graded on a certain scale of merit. Ten of these forty have 
merit 38, eight have merit 50, six have merit 60, and 16 have 
merit 68. The rankings given by the judge and the calculation 
of the threshold and the dispersion are as follows: 


TABLE LXXIII 


NUMBER | PROPOR- 
Merit or! |Saypre 1s)"BertER” ve 
Tions_ |No. ae joes aE A wx ws Wxs ws? 
ey a pete eae ie 
STANDARD STANDARD| TABLE to 
USED K-W 
Ss 
38 10 7 .70 —.524401| §.757|—3.019| 218.8)—114.7| 8313 
50 8 4 50 .000000} 5.093 .000|} 254.7 .O| 12733 
60 6 3} .50 .000000] 3.820}  .000| 229.2 .O| 13752 
68 16 2 .875 1.150349| 6.199] 7.131} 421.5| 484.9 28664 
20.869} 4.112|1124.2| 370.2] 63462 
Dw | Dwx | Dws | Dwxs | Dws? 


Thus the normal equations are: 


I s 
—_ — ; ae a 6 = 
4.112 5 1124.2 + = 20.8 g9=0 


I § a 
BA = 5 03462. + z 1124.2 = 0. 


Their solution gives 

o = 19.55 and s = 50.02 
We thus conclude that the integral of a normal distribution 
having a mean of 50.02 and a standard deviation of 19.55 is 
the best fit determination. If the purpose of the investigation 
has been to determine the merit of the sample, we conclude 


330 STATISTICAL METHOD 


that 50.02 is the best estimate of its true merit. The error of 
this value is unknown, but if the standards of comparison have 
been such that proportions, p, not greatly different from .5 
have resulted, the standard error is probably in the neighbor- 
hood of 1.50/V2N. If all the proportions are very large or 
very small, the error will be much larger than this. If it is 
known ahead of this calculation that the sample has a certain 
merit, let us say 45, then the calculation shows that the syste- 
matic error of the judge is 5.02 and that his chance error is 
represented by a distribution with standard deviation 19.55. 
Note that systematic error is synonymous with threshold, and 
standard error of judgment with the psychophysical measure 
of dispersion. 


CHAPTER XIII 
INDEX NUMBERS 


Section 93. Tur BEARING OF PURPOSE AND MATERIAL UPON 
Form oF INDEX 


Tue discussion in this chapter will be with reference to 
price ratios and averages of such ratios, as they are found to 
vary from time to time. The treatment does not, however, 
necessitate that price and time be the two variables. In 
dealing with size of certain organisms in a liquid media, length 
and temperature might be the two variates. Illustrations from 
other fields will be equally obvious. 

In planning the construction of an index number in the field 
of economics three questions are important: (a) What is the 
purpose to be served by the proposed index? (6) What price 
and quantity data can be selected or collected to best serve 
this purpose? and, (c) What form of index is the best in the 
light of (a) and (b)? 

(a) Though the chief treatment of this chapter is with (c) 
it should be borne in mind that differences in (a) and (b) can 
conceivably completely change the form of index which is most 
suitable. In particular a problem requiring an index, the mean- 
ing of which can be accurately grasped by a lay audience, 
cannot involve geometric and harmonic means; an index 
which, for the use that is to be made of it, must be reversible 
no matter what year is made the base, cannot be built upon 
quotations of commodities differing from year to year; an 
index which is required to serve the double purpose of being 
equally serviceable whether price relatives or quantity rela- 
tives are sought, cannot be asymmetrical with respect to prices 
and quantities; an index which is designed to picture an 
aggregate condition in an industry, country, or other unit, 

331 


332 STATISTICAL METHOD 


cannot be based upon partial data unless it incorporates pro- 
vision for estimation of omitted material; etc., etc. 

Fisher (1921) has especially stressed the value of an aggre- 
gate index which is both a price and a trade index, permitting 
interpretation as to quantities involved as well as prices paid. 
He implies that an “‘unbiased”’ index meeting these conditions, 
of which there is more than one, is the index par excellence, 
answering all the essential problems. As to whether this is 
so is a question of economics and only secondarily of mathe- 
matics. For this reason the present treatment stresses this 
feature less than does Fisher. This does not imply a disagree- 
ment with Fisher but rather an indisposition to attempt to 
answer a problem which is in the main economic. 

The number and nature of the commodities entering into an 
index depends upon the degree of accuracy required and the 
particular purpose to be served. They are consequent to the 
form of index used only because certain indexes require both 
price and quantity data while others are less exacting. Having 
determined the form of index, and knowing the purpose, and 
ruling out of consideration the index which is a complete survey 
of a field the question in choosing commodities is, what are the 
principles which should control in drawing a sampling? The 
fundamental principles of multiple correlation apply — high 
correlation with the purpose to be served and low intercorrela- 
tion. If a coal price index is being constructed from a small 
number drawn from a much larger number of quotations, the 
quotations should be chosen so that (a) each is as little cor- 
related as possible with the other quotations included in the 
index, and (b) each is as highly correlated as possible with the 
other quotations in the field not included in the index. It is 
to be expected that commercial tendencies will conspire to 
prevent any quotation from markedly possessing both char- 
acteristics, in which case a balance must be struck between 
them: (b) is the more important if the number of quotations 
in the index is small, say not over six, but (a) is by far the 
more important if the number of quotations is large. In fact, 
quotations that are excellent for incorporation in an index 
number based upon a small number of items may be expected 
to be relatively inferior for incorporation in an index based 


INDEX NUMBERS 333 


upon a large number of items. This brief observation as to 
the significance of correlation between commodity prices is, 
in the main, an addendum to, not in opposition to, the points 
involved in Mitchell’s (1915) very thorough exposition of 
the question of what commodities should be included. 

The preceding paragraphs merely touch upon the various 
phases of the problem of purpese and selection of material. 
No one source covers this adequately, but the reader will find 
a fairly complete treatment of all phases of the problem in 
the following selected list of references: Edgeworth (1896) 
and (1887, 88, 89, 90), Fisher (1913) and (1921), Knibbs 
(1912), Mitchell (1915), Pearson (1910, const.) and (1911, ops.), 
Walsh (1901) and (1921). 

The succeeding treatment of topic (c) is taken with some 
modification and abridgment from Kelley (1921, cert.). 


Section 94. THe MEANING oF A Price RATIO AND OF A PRICE 
INDEX 


The price of a commodity in some one year, p, (the super- 
script designates the commodity, while the subscript desig- 
nates the year), divided by the price of the same commodity 
in a second year, pl, is p/p, and is called a price ratio. A 
composite of several such ratios purporting to portray a general 
relationship between prices in the two years is a price index, 
P,/P2. The fundamental concept in this is the ratio or geo- 
metric concept. Indices can be built upon many bases, but 
irrespective of the method of construction, the usual inter- 
pretation will involve this geometric concept. The lay reader 
will think that P, is a certain proportion of P2, and Pz» is the 
inverse proportion of P;. An index which is not reversible 
does not parallel the thought processes inherent in the concept 
‘‘price ratio,” and this more elementary concept, where reversi- 
bility is the rule, is the one by means of which “price index” 
is interpreted. Even writers who are quite aware that the 
index they are using is not reversible, use price ratios and price 
indices in such a way that it is obvious they expect the same 
sort of concept to be called up in the reader’s mind; for example, 
“pl /pl, = 122, but Pi/P2 = 120 so that. eter. 


334 STATISTICAL METHOD 


In so far as the concept P,/P: is commonly of a different 
nature from p',/ p's, it lies in the fact that P; and P: are averages, 
and p!, and p', are single measures. Accordingly, to parallel 
customary thinking, Pi/P:, should mean a reversible propor- 
tion between averages. What an ‘‘average”’ is may not be so 
definitely established in the minds of scientific people generally 
as is the idea ‘‘ratio,’’ but probably the most common concept 
is that of arithmetic average or mean. We therefore have 
the somewhat anomolous situation of P;/P: calling up the 
arithmetic concept when dealing with the two separate elements 
involved in it, but the geometric concept when dealing with 
the thing entire. Since this mixture of concepts seems likely 
to persist, the writer proposes as an important test of the 
excellence of an index number the closeness with which the 
operations involved in it parallel general thinking tendencies: 
First and most important, reversibility of ratio, and second, 
arithmetic averages involved in the parts. 


Section 95. THE PROBABLE ERRORS OF VARIOUS INDEXES 


That a price index has a probable error is a fact not always 
recognized and not entirely obvious, for it may easily happen 
that the price ratias are entirely reliable. It may be possible 
to say that the price of cotton at a certain time was p'; and at 
a second time pl. If the price quotations are accurate, then 
the price ratio p/p is a true measure. The average of 
several such gives P/P.2, which is invariable. Therefore, 
P,/P: has zero probable error as far as being the average of 
these particular things, but the very combining of them in- 
volves the assumption that the index has significance beyond 
the particular data from which it is calculated. The only 
exception would be when P; and P, are determined from all 
the possible data. As an example, let p'; be the price of coal 
at a certain mine at the first date, 2; the price at a second mine, 

.., p% the price at the last mine, and similarly for the p»’s. 
Then, since all the sources are involved, P;/P, is the index of 
coal prices and has no probable error, except such as might be 
due to faulty quotations and calculations and could therefore, 
by proper care, be made negligible. 


INDEX NUMBERS 335 


This is not the typical situation. Ordinarily but a few 
quotations are worked up into an index and the result taken 
as representative of an industry or a field. We therefore have 
quotations which are samplings of the prices in the industry, 
and the statistical methods for determining the reliability of 
samplings apply. The formulas for probable errors given 
in succeeding sections are based upon certain assumptions, 
including that of random sampling, but if 25 or more per cent 
of the possible quotations are utilized, material error in the 
formulas is introduced, the true probable errors being less 
than those given by the formulas. It is to be understood that 
by probable error in an index number is meant that which 
arises from incompleteness of data. In the following determi- 
nations of probable errors of index numbers as given by various 
formulas, the attempt is to see how closely one can approxi- 
mate, by a sample, the number which would be obtained were 
all the possible data utilized in determining the same sort of 
index. The probable error indicates how closely the results 
from the sample may be expected to tally with the results from 
the whole. Should there be a constant tendency in the form 
of index used, systematically leading to too high or too low a 
value, we have a systematic error, which is entirely distinct 
and which is not measured by the size of the probable error.* 

The reason why a few quotations can yield an index which is 
a close approximation to a general tendency is that there is a 
high correlation between the quotations included and those 
not included in the index but pertinent to the function being 
measured. If there are two hundred coal mines and quota- 
tions from a half dozen are taken, an index in close agreement 
with the true index based upon the two hundred may be ex- 
pected, because of the high correlation between quotations at 
different mines. To say that there is a high correlation is 
not equivalent to saying that the prices at the different mines 
tend to approach the same level, but that they tend to main- 

* In the tests of indices suggested in Section 97 there will be found none to the effect that 
an index should have no bias. The reason for this is that reversibility of ratio, or change 
of base, which is included as one of the tests, is not possible with a ‘‘biased’’ index. Fisher 
(1921) shows that an index may possess a bias due to form and a second bias due to base 


value weighting, and that these may exactly neutralize each other. Sucha situation would, 
statistically, be the same as one not involving bias. 


336 STATISTICAL METHOD 


tain a uniform difference. Mine A, near tidewater, may sell 
at a certain price, p!, much higher than that, p?, at mine B, 
remote from a center of consumption, without indicating an 
economically abnormal condition in the coal trade. If p', p’, 
and other similar measures are averaged, the probable error 
of this average is not given by the usual formula 


P; Emean = .6745 Ux 


due to the heterogeneity of, and to the correlation between, 
the p’s. As an illustration, more extreme than mine quota- 
tions on coal, let us average the following prices: 


Bacon per pound a7. sae ane eae 7O 
Breaduper pOuticl sss: ee eres .10 
Potatoes‘per bushel - = . . ; 1.20 
Apples*penibox 75 asa) te eee LOO 

AVELAge,e). acs thoes wee, Gee 3:00 

Standard deviation . . . . 4.06 
P. E. (by above formula) . . . 137 


Now, presumably, the probable error of no single one of these 
quotations is as great as $1.37, and the average of them all 
will probably fluctuate but little. There probably is positive 
correlation between these food prices, a rise in one generally 
going with a rise in each of the others. These conditions are 
not those under which the probable error of an average is given 
by the usual formula. For statistical purposes there is much 
to be gained by having homogeneous uncorrelated material. 
We can secure measures which are nearly, if not entirely, 
homogeneous and uncorrelated by dealing with price ratios 
instead of prices.* 


* In one sense, both prices and price ratios are very highly correlated, but these corre- 
lations have quite different statistical consequences. As the price of coal at mine A ap- 
proaches ph, due to correlation the price at mine B approaches what may be a very differ- 
ent value, p41; but as the ratio, pl/ple, from the quotations of mine A approaches, as time 
changes, the value p, due to correlation, the ratio of the quotations from mine B may be 
expected to tend toward the same value p. (The rigorous proof of this statement would be 
necessary before the present treatment and statement of probable errors can be considered 
final. Whatever error is involved is of a conservative nature, as it almost certainly would 
tend to make the obtained probable errors too large.) Although correlation between prices 
tends to throw ratios together, it tends to keep prices apart. If, therefore, we deal with 
ratios, the effect of correlation has already operated upon the measures used, making the 
distribution of ratios more homogeneous, and as a consequence making the mean more 
reliable. In other words, the standard deviation of the ratios of prices at date 1 to those at 
date 2, oi, is reduced from what it would be were there no correlation between prices, so 
that by this very reduction, the probable error formula when applied to ratios takes account 
of the correlation between prices at two different dates. Fora rigorous approach to the 
question of probable error of a ratio see Pearson (1910 const. and IQII ops.). 


INDEX NUMBERS 337 


Accordingly, if the price index showing prices in year I 
relative to year 2, called zp, is given by the equation 
; IE 
13 = P, = ne (Index formula 1) .[313] 
and if the standard deviation of the price ratios is ow, the 
probable error of 7 is given by 


ow. (Probable error of 
VN index formula 1) .[314] 


12 NB, 112 =! .6745 


Let us consider another kind of index, 


er eae 

pea oF (Index formula 2). .[315] 
The complete probable error formula for this kind of index 
involves the correlation between the ~’s. (See Pearson, 1910, 


ops.) The index 


liz = Sa2 (Cae (Index formula 3) .[316] 


will be more reliable than formula 1 if the weights, w, used are 
exactly or approximately proportionate to the values of the 
commodities involved. In general, the greater the price ratio 
the less the consumption and vice versa, so that the distribution 
of the weighted price ratios will have a smaller variability than 
the distribution of price ratios alone. If w = pod, the value 
of the transacticns in year 2, the formula becomes 


112 = 2 ae (Index formula 4)......[317] 


D p2g2 


Formula 4 is but a type of formula 3. It is undoubtedly more 
reliable than either 1 or 2, but there are too many variables 
involved for the writer to attempt a calculation of its probable 
error based upon the data for two dates only. If, however, 
the commodities are divided into random halves and indexes 
determined from each half, the correlation between these sub- 
indexes may be calculated, and from it the probable error of 
the total index may be obtained, as follows: 

Let there be n commodities, equally excellent as representa- 


338 STATISTICAL METHOD 


tive of the whole field, which are built up into the index 7. In 
order to determine the probable error of 7 we may first build 
up two indexes, A and B, each based upon a random half of 
the commodities. Calculation of A and B for a number of 
dates will give two series, the correlation between which may 
be found. In doing this it is desirable that the time interval 
between successive indexes be sufficient to insure the relative 
independence of the commodity quotations involved. Just 
as the average of the prices of bread on January 1 of a certain 
year and on December 30 of the same year will in general give 
a truer average yearly price than the average of the prices on 
June 30 and July 1, because in the former case the two quota- 
tions are nearly independent while in the latter one has prac- 
tically but a single quotation, so sub-indexes calculated at 
too short intervals of time scarcely constitute new data, but 
rather repetitions of old data. Were the correlation between 
successive quotations known, practical limits could be set 
giving periods shorter than which it would not be worth while 
to calculate sub-indexes. Having o, the standard deviation 
of these sub-indexes, and having 7, the correlation of the sub- 
indexes, we may determine the standard error of the average 
of the two sub-indexes, i.e., of the total index, 7. As given by 
Kelley (1921, cert.), it is 


\ 1 —yr (Standard error of an index in terms of the 
o1=>0a aes he : 
2 standard deviation and correlation of 
sub-indexes) 9 ..clan «pene eee eee [318] 


Note that r and o must be obtained from the same series of 
sub-indexes. 

The practical advantages of reporting two sub-indexes as 
well as the total index may well be as great as has been found 
to be the case in reporting two comparable measures in the 
fields of psychology and education. The probable error of 
any index may be determined if comparable sub-indexes are 
calculated and if the series of indexes covers a sufficient length 
of time to yield a reliable measure of correlation between 
sub-indexes. Probably 16 pairs of quarterly sub-indexes would 
suffice. Since a means of determining the standard error of 
any index is available, we may say that a second important 


INDEX NUMBERS 339 


measure of the excellence of an index number is the size of its 
probable error.* 

Space will not permit a discussion of the probable errors of 
all the proposed types of indexes, but to point out the necessity 
of such discussions the writer has made an estimate, after more 
or less complete mathematical analysis of the relative size of the 
probable errors of the index numbers given in Table LX XIV, 
Section 97. 

The one that seems the most reliable of all, and that also 
most completely meets other conditions except that of parallel- 
ing general thinking tendencies, is the weighted geometric 
mean index, in which the weights are roughly proportional to 
the reliabilities of the price ratios. This requirement as to 
weights is practically no limitation at all, as it is regularly 
approximated to by customary weighting devices. Practi- 
cally without exception the observations of Mitchell (1915) 
as to what items to include in an index and what weights to 
give, are statistically equivalent to weighting price ratios 
according to reliability. 


Section 96. Tue AccuRACY AND FLEXIBILITY OF THE 
WEIGHTED GEOMETRIC MEAN INDEX 


The weights of the commodities involved in an index may 
be changed with much greater facility in the case of some 
indexes than of others. As soon as a commodity becomes 
archaic the proper thing to do is to withdraw it, and with- 
drawals and entrances are readily accomplished with the 
geometric index. The weighted geometric mean index for- 
mula is 


Suk: Dw (phy)™ (p21) Pasce (pn) @n 
v (pty) @ (p%)w2 - ++ (pm) Yn" 
* I judge from the limited abstract of his study that Fisher (1921) has calculated a large 
number of different indices from the same material and found that certain formulas give 
highly comparable results. The uniformity of indices involving the same data is not the 
problem of reliability here attacked. We are concerned with the problem of sampling, 
As to whether Professor Fisher has also compared an index determined from a part of his 
data with the same index as obtained from a larger part I cannot determine from the ab- 
stract, but if so it constitutes an experimental approach to the problem in hand. One 
would expect that the differences which Professor Fisher would find between an index 
based upon, let us say, } of his data and one based upon the remaining { would be somewhat 
larger than implied by the formula here given, as the index based upon the ? would bea 
fallible standard. A study of the uniformity of indices based upon the same data throws 
light upon the existence and the nature of systematic tendencies, or biases, but none what- 
ever upon the error of sampling. 


(Index formula 5) ...[319] 


340 STATISTICAL METHOD 


For convenience, and without any loss of generality, 2w may 
be made to equal 1. Thus, letting w = wi/2w, w2 = w,/=w, 
etc., and letting pi: = p/p, pp = p/p, etc., 


4 = p,*1 pir *** Pyn: (Index formula 5 a). .[319 @] 


Note that with this formula the index is reversible and that 
there is complete freedom in changing the base. Assuming as 
before that there is no correlation between ratios, the probable 
error is given by 


ee L paar W2072 _, weno n 
P. E.i = .6745 Ser = 3 es af ay 
(Probable error of the weighted geometric mean index) . .[320] 


in which the p’s are successive price ratios and the o’s their 
standard deviations. As an approximation, the o’s may be 
considered to be equal to each other and to equal the standard 
deviation of the distribution of price ratios. In order that this 
probable error remain small, it is necessary that no one of the 
ratios wy/p1, We/ pe, etc., be exceptionally large. 

Et Ape as 

Pi pu 
Letting qg', equal the quantity of the commodity consumed, or 
in trade, it would be expected that gph would fluctuate much 
less than p41, and whereas there might be danger of p'; becom- 
ing extremely small or large there is not equal likelihood of 
quip: doing so. Accordingly, if w, is approximately =q'1ph, 
then w/p. = qip's, a magnitude which is not likely to be 
extremely large. However, should a commodity change greatly 
in its relative importance, the weighting of it may easily be 
changed as follows: 

Let it be desired to change the weight of the price ratio p; 
from w; to Wy, which we will say is a smaller weight. We need 
‘not impose the condition that p, = 7. For p,; >7 we will 
search the list of price ratios for (a) a ratio > 7 which is under- 
weighted, or (b) a ratio < 7 which is overweighted. Suppose 
p2 is such a ratio. Ordinarily there are a number of price 
ratios = 1.0, or 7, or some other value which is the modal 
value. These may be combined and represented by p‘, where 
p is this modal value and s the sum of the weights of all the 


INDEX NUMBERS 341 


ratios having this value p. Letting P stand for the product 
of all the terms other than py, ps, and the p terms, we have 


GY pe 

t ="\/ pw, p22 ps P 
and it is desired to change this to 

© ge 

= a/ pWiy pW pS P. 


The first index will equal the second in case 


(1) Wea Wat- = oSa— VV pt WV gate odeyaies sclae os coer eieens [321] 
and also 
(2) pi, py ps = pW, pW ps, CGC G OMG totholo oo oo [322] 


or, taking logarithms, 

w, log p; + welog pe + slog p = W, log pi + We log p2 + S log p .[323] 
W, is the new weight that has been assigned (this may be 
zero) so that everything involved is known except W» and S, 
and the solution of the two equations simultaneously will yield 
these. Ordinarily S will differ but slightly from s, and W» 
will differ from we. in the direction in which it is desirable it 
should differ. Thus, as a practical matter, the weight of any 
price ratio, whether equal to 7 or not, may be changed without 
affecting the index. 

No other index, as far as the writer can determine, offers 
the extreme flexibility in changing weights, dropping or adding 
new items, here found to exist for the geometric mean index. 
Since this is so, the weights can be made such that extreme 
ratios are given small weights or eliminated. As a conse- 
quence, the probable error of such a weighted geometric mean 
index may be expected to be smaller than that of any other 
index mentioned. The excellence of this index seems to the 
writer so great as to warrant its use, even though it involves 
a change in the established habits of interpretation of the 
usual reader. 


Section 97. CRITERIA FOR JUDGING OF THE EXCELLENCE OF 
INDEXES 


Two criteria, the paralleling of habitual modes of thinking 
and reliability, have been proposed in judging the excellence 
of an index measure. Fisher (1913) has used eight other 
tests, three of them being tests only of “trade” indexes. It 


342 STATISTICAL METHOD 


would seem that these latter would be of particular impor- 
tance only in case an index ceases to be a sampling and be- 
comes an expression of the sum total of transactions involved. 
Table LX XIV, in part taken from Fisher (1913), gives “‘scores” 
of the most important index measures upon several tests or 
criteria of excellence. 

Test 1: Reliability. In giving scores upon this point the 
writer has freely used his judgment in the case of indexes for 
which no simple probable error formula is available. More 
or less complete statistical analysis has preceded this scoring, 
but it is in no sense to be considered final. An “s-2” after a 
score means that no simpler way for calculating the probable 
error than by means of the correlation between comparable 
sub-indexes seems to be available. As the writer judges this 
test to be the most important of all, the scoring is 3, 2, 1, ando, 
instead of 2, 1, and o—the larger the score, the higher the 
rating. 

Test 2: Parallels habitual modes of thinking. Score 2, 1, o. 

The following tests are from Fisher. 

Test 3: Proportionality. ‘‘A price index should agree with 
the price ratios if these all agree with each other.” Stated 
algebraically: 
dea a 
ps ps 
Score of 2 if true for any two years. Score of 1 if true only 
when year 2 is the base year. 

Test 4: Entry and withdrawal. A price index should per- 
mit the entry and withdrawal of price ratios without changing 
the value of the index. Fisher uses a less general test: “A 
price index should be unaffected by the withdrawal or entry 
of a price ratio agreeing with the index.” The scoring here 
follows Fisher, except for formula 5, which Fisher does not 
include in his list of 44, and for formulas 14 and 15 which are 
here scored higher than by Fisher.* Score 3, 2, 1, 0. 


=etc. =i. Required ate = 4. xe ees [324] 


Given P, 


* Fisher scores both of these formulas zero on the basis of entrance and withdrawal of 
items. However, as shown by Kelley (1921 cert.) a new commodity, whose price ratio 
agrees with the index, may be introduced into index formula 15, without changing its value 
provided quantities are in the ratio, 


py 2 ab(di — c) 


qa cd(a — bi) 


INDEX NUMBERS 343 


Test 5: Change of base. ‘The ratios between price indexes 
should be unaffected by reversing or shifting the base.” Alge- 
braically stated: 
tes tsa _—S#@Pg 


ia Pia a1. .[325] 


414 412 


: 1 P. : 
Let 412 = P,’ 1g = BP,’ etc. Required that 


Give score of 2 if true for any two years, score of 1 if only 
true when the base year and one other is involved, i.e., if only 


5 133 A 199 : 
such equations as Fgh 131, 7, etc., hold. 
13 42 


Test 6: Change of unit of measurement. ‘‘The ratios 
between various price indexes should be unaffected by chang- 
ing any unit of measurement.” Score of 2 or o. 

Fisher has a “‘Determinateness” test which he describes in 
the words, “‘A price index should not be rendered zero, infinity, 
or indeterminate by an individual price becoming zero.” This 
is but one phase of reliability and is therefore included in 
Test 1 above. 

In the formulas listed the q’s stand for quantities of com- 
modities consumed or in trade and are weights of the p’s. 
When weights not exactly equal to the q’s are involved, the 
symbol w is used. It is of course assumed that care would 
be exercised in selecting these weights. 9 and qo instead of p» 
and gq are used in those formulas in which the treatment of 
the data for the base year is unique. Test 5 is not completely 
met by any such formulas. 


in which 
a@=2Zhin 
b = Dp 
c = Zhige 
d = ZDpoq 


pe yz ym Piae. (Index formula 15) 
Lp Dp2«qG2 
Also, if quantities are in the ratio 

qa _2 pon 

G2 Lprqe 
a commodity whose price ratio is equal to the index may be introduced into index formular4, 
Dpia 4 Zhe 
_ Zoom 2D p2q2_ 

2 


tie (Index formula 14) 


without changing its value. 


STATISTICAL METHOD 


344 
TABLE 
Scores of Index Numbers upon 
(1) (2) 
Type IA Type IA 
ay pi 
N 2 po % bo” 
TESTS Carli pw 
Evelyn 
Economist Young 
Sauerbec, Falkner 
Soether Dun 
1 Reliability,—Smallness of P.E. 5 S-i 1.5 S-i 
2 Parallels habitual mode of ane i T 
3. Proportionality 2s 2. 
4 Entry and withdrawal Py, 25 
5 Change of base fo) 0 
6 Change of unit of measurement. 2 Ze 
Totals ht Sete 75 8.5 
TABLE 
Scores of Index Numbers upon 
(8) (9) (10) (11) (12) 
Type IV Type IV TYPE My Type IV Eta IV 
go q Vv LSO 
zp SS | ee N oe er tk 
— EE Se ee V qoq1 E pig 
EST: A 
=p = piw =e 2 > pogo 
z p2 z= pw Scrope 
Bradstreet Lowe Scrope Sidgwick 
Edgeworth and ciierhoals 
Marshall Walsh Giffen 
I 1.5 S-i 2.5 S-i 2.5 s-i 2.5 S-i 25S-1 
2 2. 2s 1.5 1.5 1 
3 25 2. I if 2. 
4 25 2. 1; rhs a 
5 2: >: i De 0 
6 .O 2.— 2 2: 2s 
Totals 9.5 12.5 — 9.0 9.0 9.0 


Type IA: Arithmetic average of ratios 
Type II: Median of ratios 


Type IH: Harmonic average of ratios 
Type III: Geometric average 


Formulas 7 and 9, which are given the highest scores, involve 
weights, w, instead of quantities, g. There is great flexibility 
in each of these so that if a weight is adopted, let us say in the 
first instance upon the basis of quantities (if using formula 9) 
or values (if using formula 7) in trade, which tends to become 


INDEX NUMBERS 345 


LXXIV 
Basis of Six Tests of Excellence 


(3) (4) _ (s) oO) (7 
Type IA Type II Type II Type III Tyee U1 
MEDIAN ALSO ALSO 
p VALUE OF Type V TYPE V 
PL 7 
2 Do oe Ose Heh Vopr... Weighted Bese 
aie wre: Seem . n,——— & S 
Zz pigi plo’ p% Weighted W pions geom. 
Palgrave Edgeworth median Jevons e mean 
Westergaard 
I. S-i 2 3 I 33 I 
rs I Tt 5 5 2 
ite 2 a 2 Pe 3 
Ve I.— I.— Pe Be 4 
Ko) I.— I.— 2: 2 5 
DQ. 2. 2 2. Ze 6 
6.0 9.0 — 10.0 — 9.5 12.5 Totals 
LXXIV—Continued 
Basis of Six Tests of Excellence 
(13) (14) (15) (16) (17) 
Type IV Type VI Type VI Type V TYPE V 
ALso > pigu 2 pigi 
Type IH =a D poqe 
S pig 4 Zq REE 
are Arith. peas 
> pogi average Geom. = poq2 YW qhig'1.-. TESTS 
of (12) average z= q2 qa rar 
Scrope and (13) of (12) and WY @2q?2 Texe 
Sidgwick Sidgwick (13) Drodisch 
Sauerbeck Drobisch Rawson- Nicholson 
Giffen Rawson Walsh 
2. S-i 2.5 S-i 2.5 S-i 2. S-1 2. S-1 I 
5 5 1.5 5 5 2 
ile I is oO .O 3 
I I ile fe) .O 4 
.O O iT DQ, 2 5 
2 2 Ds .O De 6 
6.5 7.0 9.0 2.5 4.5 Totals 


Type IV: Quotient of aggregates 
Type V: Quotients of functions of data of single years 
Type VI: Composites of preceding types 


unreasonable, it can be changed without affecting the index 
between the year when the change is made and the preceding 
year. If years from early to late are designated by 1, 2, 3, 4 


and if a formula-7 index number is started at the end of the 
first year, using weights proportionate to the values of the 


346 STATISTICAL METHOD 


commodities in trade, and continues until the beginning of 
year 4 before a change in weights is desirable, a change can at 
that time be made which will preserve the index 734 and its 
reciprocal 743. The new weighting would probably give an 
ig and an 74, were they to be calculated, which would be slightly 
different from those given by the equations: 
in = 7,and in = = 

which would exactly hold had no change in weights been made. 
This difference will usually be small, but if an index is demanded 
permitting changes in weightings and at the same time enabling 
the use, with exactness, of any year as base, it may be made by 
the expenditure of a little more labor. 


Section 98. Tue Use or ANY YEAR AS BASE 


Formula 12 (or 13) in which there are no parameters, or 
flexible weightings, will serve as a foundation: 


Fite 2 Pride 
Lpoqe 
Let M, = the mean of the p,’s 
m, = the mean of the q’s 
Si = the standard deviation of the f;’s 
s, = the standard deviation of the q’s 
ru = the correlation between the p,’s (represented by 
the first subscript) and the q;’s (represented by 
the second subscript). 


Symbols with other subscripts have comparable meanings, 
€.g., Yo4 = the correlation between the po’s and the q4’s._ Then, 


z Pige = N (Mim, fh 11991 ot 
> pog2 = N (Meme + 12252 52) 


Consequently, the numerator and the denominator for the 
index between any two years may be built up if the means, 
standard deviations, and correlations are known. The data 
required may be calculated each year, as the data for the 


INDEX NUMBERS 347 


year become available, and tabulated in such a table as the 
following: 


Data for Determining Index with Any Desired Year as Rase 


Y pq: P FOR YEAR INDICATED IN STUB AND g FOR NUMBER 


Wee | Ae| afe.\S Pp OF YEARS INDICATED EARLIER (—) OR LATER (+) 


32| —16| —8] —4] —2}—1| 0 |+1/+2/+4 |+8 |+16/+32 
I919 9” )| ose te i 
1918 8 
1917 GAM 2 eee) 2) ail. tai * 
I916 6 
1915 5 
tora | 4 
1913 3 
1912 2 
IOII I x x x 


If it is desired to make 1917 the base and to express the 
prices in 1919 and rort relative to it, then Zpsq; is determined 
from the magnitudes recorded in the compartments in which 
there is ‘++’; Dpig7 from the compartments in which there is 
“x? and Sp.q7 from the compartments in which there is “*.” 

The table as drawn up does not provide space for all the 
possible correlation coefficients. With such care as could be 
taken in choosing the units of quantity, the correlation coeffi- 
cients could be made to vary from year to year in a very regular 
manner, thus enabling interpolation with high accuracy. There 
is complete freedom in changing the weights of commodities, 
but it should be noted that a commodity ‘“‘dropped”’ continues 
as one of zero price and zero quantity — in other words, the N 
has not been decreased by ‘‘dropping” the commodity. To 
change the weight of a commodity price from w to w’ demands 
a warrant. Let us say that such warrant is found in the ratio 
of the quantities consumed. No less warrant is necessary 
when w’ is zero. An article once included in the index should 
come out only in case it becomes practically obsolescent. No 
distortion of any index would result in this case. We may of 
course take out a commodity under other conditions without 
affecting some one particular index. 


APPENDIX A 
LIST OF IMPORTANT SYMBOLS 


When dealing with a single variable: 


I. 


2. 


Oo om nm 


Io. 
fb 


12. 
13: 
EA. 


Lae 
16. 


La 


18. 
1Q. 


N designates the total population. 


n is used as an exponent or subscript, or as the population 
of a sub-sampling. 


. X designates a gross score, i.e., a score as a deviation from 


zero in the quantity scale being considered. 


. « designates a score as a deviation from the mean. 
. & designates a score as a deviation from an arbitrary origin. 


M designates the arithmetic mean. 
Mdn designates the median (= P.s0). 


. Mo designates the mode. 
. p designates the proportion of cases lying below the 100 p 


percentile, —to the left of a dichotomic point in a 
frequency polygon. 

P, designates the value of the 100 p percentile. 

q is defined by p + g = I. 

U.Q. designates the upper quartile (= P.75). 

L.Q. designates the lower quartile (= P..5). 

Q designates the quartile deviation, or semi-interquartile 
range ( = [U.Q. — L.Q.]/2). 

D designates the 10-90 percentile range (= P.9 — P10). 

A.D. designates the average deviation, i.e., the mean 
deviation from the mean. 

o designates the standard deviation from the mean of 
scores in a distribution. 

P.E. designates the probable error (= .6744898c) 

s designates the standard deviation from some point other 


than the mean. 
349 


350 STATISTICAL METHOD 


20. D designates a summation of scores of the sort indicated. 

21. S designates a summation of summations, or of elements 
other than individual scores. 

22. ¢ with a subscript designates the standard error of the 
constant represented by the subscript. 

23. P.E. with a subscript designates the probable error of the 
constant represented by the subscript. 

24. 1 designates the class interval, or width of base of a given 
class in a frequency polygon. 

25. v designates the value of the lower boundary of a class 
interval. 

26. v’ designates the value of the upper boundary of a class 
interval. 

27. f designates the frequency in a class interval. 

28. F designates the sum of the frequencies below a given 
class interval. 

29. F’ designates the sum of the frequencies above a given 
class interval. 

30. A or 6 designates the difference between the mean and 
arbitrary origin (= M-Arb. orig. = 4). 

31. i, M2,°**Mn designate the moments from the mean (a) 
without application of Sheppard’s corrections if they are 
inconsequential for the problem in hand, or (b) after 
application of Sheppard’s corrections if they are used. 

32. V1, ¥2,°** VY, designate the moments from the mean before 
application of Sheppard’s corrections in problems in 
which Sheppard’s corrections are used. 

33- Mi, Ma,***Mn OF 4, ¥2,***¥, Aesignate moments from an 
arbitrary origin. 


When dealing with the normal distribution: 

A normal distribution in which N = 1 and o = 1 will be 
referred to as a ‘unit normal distribution,” 

In the general normal distribution, « as defined in 4, 
o as defined in 17: 

34. y designates the ordinate per unit interval (= <N/o [as 

defined in 36 and 17]). 

In the “unit normal distribution’”’ : 


APPENDIX A 351 


35. x designates a deviation from the mean 
(= « [as defined in 4] \_ 
o [as defined in 17] 


36. 2 designates the ordinate 
(= y o [as defined in 13 and 7) : 
N 
p and q as defined in 9 and 11 (p =f" zdx = [1 —al 
of Sheppard). "ha 


37. Corresponding to a deviation x, we have fi, qi, 21; or 
corresponding to a proportion #1, we have q, 2, and xj. 


38. I designates fz dx (=a/2 of Sheppard). 
0 


When dealing with unimodal distributions: 
39. Yo designates the ordinate at the origin (generally at the 
mean, the mode or a boundary). 
40. m is an exponent. If two exponents are needed, m, and 
Mz are used. 


41. a in general designates the distance between the origin 
and a finite boundary. If two boundaries are finite, 
ad; and a are used. 


When dealing with price indexes: 

42. pts designates the price of commodity ¢ at date s. 

43. g's designates the amount consumed, or in trade, of com- 
modity ¢ at date s. 

If few commodities are involved, subscripts are arabic 

numbers, and superscripts are primes. 

44. ps designates the price of an unspecified commodity at 
date s. 

45. qs designates the quantity consumed, or in trade, of an 
unspecified commodity at date s. 

46. psu designates a price ratio or the ratio of the price at 


date s to the price at date u (= es) : 


47. P; is a composite, weighted or otherwise, of the prices 
of several commodities at date s. 


352 


STATISTICAL METHOD 


48. ts, designates a price index or the ratio of a composite 


49 


Ps 
of prices at date s to prices at date u (= =) : 
u 


. w designates the weight given to a price, p, when this 
weight differs from q. 


When dealing with correlated series: 


50 


SI 


52. 
53: 


54. 


55: 


56. 


57: 


58. 


59. 
60. 


Ot. 


62. 


. Symbols as given in 3, 4, 5. Corresponding symbols for 

the second series are Y, y, ¢. 

. A second notation utilizes symbols 1, 3, 4, 5, 6, 7, 8, 9, 
II, 15, 16, 17, 18 and 19, with subscript 1 added to 
represent the first variable, and a subscript 2 added to 
represent a second variable. 

Ox = 01, and oy = on. 

Ahas the meaning as in 28, with reference to the first 
variable, and 6 this meaning with reference to the 
second variable. 

% designates the first variable expressed as a standard 
measure — (= %1/01). 2 designates the second vari- 
able expressed as a standard measure — ( = x2/a2). 
See also 36. 

r designates the product moment correlation coefficient 
between two series. 

r is also used where specially noted, to designate bi-serial 
r, Sheppard’s cos 278 correlation, and occasionally 
other specially designated correlation coefficients. 

p designates the correlation coefficient, based upon the 
squares of differences in rank. 

R designates Spearman’s foot-rule correlation coefficient. 
See also 86. 

r, designates the tetrachoric correlation coefficient. 

di.2 designates the mean standard deviation of the x- 
arrays from the regression line, i.e., it is the standard 
error of estimate of variable 1, knowing variable 2. 

92.1 designates the standard error of estimate of 2, knowing 
variable 1. 

gq designates the mean standard deviation of the x-arrays 
from the means of the arrays. 


63. 
64. 
65. 


66. 


67. 


68. 
69. 


70. 


7 ie 


72. 


73- 


74. 


75: 


76. 


7° 


APPENDIX A 353 


x designates the value of x as estimated from a knowledge 
of y by means of the regression equation. 

X designates the value of X as estimated from a knowledge 
of Y by means of the regression equation. 

y and Y have comparable meanings to 63 and 64 inter- 
changing the variables. 

In general, a symbol with a superior bar stands for an 
estimated value of a variable, or for an average, but 
note 33, 81 and 82. 

by designates the regression of the x’s upon the y’s, or 
the slope of the regression line used in estimating x’s, 
knowing y’s. 

bo, designates the regression of the y’s upon the x’s. 

h designates the grouping interval for the first variable 
(= 1, in 24), and k, the grouping interval for the second 
variable. 

x is the first variate when no grouping is resorted to. It 
is not related to x”, of go. 

y is the second variate when grouping is not resorted to. 
It is not related to y as found in the equations of certain 
curves. 

r, n, and C with a subscript preceding, such as subscript 
mtr i ete, designate a coefficient, after some correction 
has been made. 

r with « as one of the subscripts designates a correlation 
with a true score, i.e., a correlation corrected for attenu- 
ation. 

’oo« designates the correlation between two true scores, 
i.e., the correlation corrected for the attenuation in 
case of both variables. 

k designates the coefficient of alienation or the propor- 
tionate improvement in estimate, due to the existence 
of correlation (= V1 — 72). See also 85. 

p with two subscripts designates a product moment. 
Distinguish between this and 9, 89 and 117. 

d designates a difference between two scores. These 
scores may be rank positions. 


354 


78. 


79: 


80. 
81. 


S25 


83. 
84. 


85. 


86. 


87. 
88. 


89. 
go. 


gl. 


92. 


93: 


STATISTICAL METHOD 


my is the correlation ratio of x upon y, and yy that of y 
upon x. 

¢ is the test for linearity (= 7? — 7’). Distinguish 
between this and 50. 

w represents an arbitrary weight. See also gt. 

r; designates the average inter-correlation between a 
number of independent variables. 

r. designates the average correlation between a criterion 
and a number of variables. 

o used as a subscript designates the criterion. 

> designates the standard deviation of scores in a second 
range when the standard deviation in the first range is 
o. Distinguish between this and 20. 

K designates the alienation coefficient in a second range, 
when the alienation coeff cient in the first range is k. 
Distinguish between 85, 87 and 88. 

R designates the correlation coeff cient in a second range, 
when the correlation coefficient in the first range is 1. 
Note also 58. 

K? designates the mean of asummation. See formula [205]. 

x designates the number of categories in a quantitative 
or qualitative distribution. \ designates the number 
of categories in a second quantitative or qualitative 
distribution. 

p is the greater of two proportions which total 1.0, in a 
correlation table. See also 9. 

a, B, y, 6 are the proportions in the four cells of a four- 
fold correlation table. 

v and w with subscripts designate certain tetrachoric 
correlation functions. Distinguish between 25, 26, 80 
and ot. 

¢ designates product-moment correlation between two 
two-point distributions. This is Pearson’s rp, and 
also Yule’s theoretical value of r. 

¢” designates the mean square contingency. In the case 
of a four-fold only, it equals ¢ of 92 squared. 


APPENDIX A 365 


94. Q designates Yule’s coefficient of association. Distin- 
guish between 14 and 94. 

95. w designates Yule’s coefficient of colligation. 

96. mss designates the theoretical cell frequency. 

97. Mss designates the observed cell frequency. 

98. dss designates the cell divergence (= ns) — mss’). 

99. x? designates the square contingency. See also 7o. 

100. P designates the probability of a divergence as great or 

greater than that obtained, arising as a matter of 
chance. 


IOI. og, designates the standard error of the k’th difference 
correlation coefficient. 


When dealing with three or more correlated variables: 

102. Xo.12...n designates the residual in the criterion, or error 
of estimate of the criterion, after regression equation 
estimation of it by means of the other variables 
(= BO %0)- 

103. xo designates the value of the criterion as estimated from 
the other variables. 

TOA Ona x0/ 003 Zo = %0/0; 20-12..." = a.12-s0n/ 00; ec: 

IOS. Yo...» designates the multiple correlation coefficient 
between the criterion and the regression equation com- 
bination of the independent variables. 

106. ko.12...n designates the multiple alienation coefficient be- 
tween the criterion and the regression equation com- 
bination of the independent variables. 

107. 00.2...» designates the standard error of estimate of the 
criterion, when estimated by means of the regression 
equation. 

108. 701.23..-n designates the partial correlation coefficient be- 
tween the criterion and variable 1, the other variables 
being constant. 

109. ko1.93...n designates the partial alienation coefficient be- 
tween the criterion and variable 1, the other variables 
being constant. 


356 


IIo. 


Iit. 


ITZ. 


T13. 


IT4. 


125: 
TLO; 
137, 


118. 


IIQ. 


40) joel tees [eo tan) leek gee 
3S) Sr a om R 


STATISTICAL METHOD 


Bo1-23...n designates the partial regression of the criterion 
upon variable 1, the other variables being constant, 
not allowing for unequal standard deviations of the 
variables. 

bo1.23...n designates the partial regression of the criterion 
upon variable 1, the other variables being constant, 
taking into account the standard deviations of the 
variables (= Bot.23...n 00/01). 

A designates the major determinant. 

Apq designates the determinant obtained by taking out 
the p’th row and the g’th column from the major de- 
terminant. 

c designates the weighted composite of scores, generally 
slightly different from the regression equation com- 
posite. 

u designates the one variable in the c composite which is 
treated in a unique manner. 

c — u designates the c composite, after deduction of the 
variable treated uniquely. 

p designates any one of the variables, other than u, in 
the c composite. 

D designates the distance from the stump to the mean of 
a complete normal distribution, in case of truncation 
(= x0). See also rs. 

g— designates the standard deviation of a weighted 
average. 


THE GREEK ALPHABET 


Alpha Ti eaenote. P p Rho 
Beta K «x Kappa 2 o Sigma 
Gamma A \ Lamba T ore Tad 
Delta Mu Mu Y v Upsilon 
Epsilon N v Nu ® @g Phi 
Zeta ag Ge &| Xe Chi 
Eta O o Omicron WV y Psi 
Theta Il x Pi Q w Omega 


APPENDIX B 


BIBLIOGRAPHY 


Arranged chronologically under authors 


ANDERSON, VON O.: Nochmals tiber “The elimination of spurious correla- 
tion due to position in time or space’’; Biom., Vol. X, 1914. 

ANGELL, FRANK: “On judgments of like’; Am. Jour. Psyc., Vol. XVIII, 
1907. 

Bartow’s Tables of squares, cubes, square-roots, cube-roots, and reciprocals 
of all integer numbers up to 10,000; E. and F. N. Spon, Lond. and 
New York. 

BELL, Jutta: “Tables to facilitate calculation of the rhinal indices’; 
Biom., Vol. VIII, 1912. 

BERGSTROM, SVERKER: “Sur les moments de la fonction de corrélation 
normale de n variables’; Biom., Vol. XII, 1918. 

BERNOULLI, J.: “Ars conjectandi, opus posthumum: Accedit tractatus 
de seriebus infinitis, et epistola gallicé scripta de ludo pilae reticu- 
laris,”’ 1713. (A German translation in Ostwald’s Klassiker der 
exakten Wissenschaften, Nos. 107, 108.) 

BERTRAND, J. L. F.: Calcul des probabilités. Gauthier-Villars, Paris, 1889. 

BLAKEMAN, JOHN: “On tests for linearity of regression in frequency 
distributions’’; Biom., Vol. IV, 1905. 

BLAKEMAN, J. and PEARSON, Kart: “On the probable error of the coeffi- 
cient of mean square contingency’’; Bzom., Vol. V, 1906. 

Boo.eg, G.: Laws of Thought, 1854. 

Bore, E.: Eléments de la théorie des probabilités. Hermann, Paris, 1909. 

Borinc, Epwin G.: “Mathematical vs. scientific significance”; Psyc., 
Bul., Vol. XVI, No. 10, 1919. 

——: “The logic of the normal law of error in mental measurement’’; 
Am. Jour. of Psyc., Vol. XXXI, 1920. 

Bow ey, A. L.: “Relation between the accuracy of an average and that 
of its constituent parts”; Jour. Roy. Stat. Soc., Vol. LX, p. 855, 


1897. 
——: Elements of statistics, third edition; P. S. King, London, 1907. 
Bravais, A.: “Sur les probabilités des erreurs de situation d’un point,” 
Memoires ...UAcademie royale des sciences de lVinstitute de France; 


sciences mathematique et physique, Vol. IX, pp. 255-332, 1846. 
Brinton, WiLLarD Cope: Graphic Methods for Presenting Facts, N. Y. 
Engineering Mag., 1914. 
357 


358 STATISTICAL METHOD 


Brown, WitiiAM: The Essentials of Mental Measurement; Cambridge 
University Press, Lond., 1911. 

Brown, WiLt1AM and THomson, GODFREY H.: Essentials of Mental 
Measurement; Cambridge, 1921. 

Brunt, Davip: Combination of Observations; Putnam, 1917. 

CaRVER, Harry C.: “Mathematical representation of the frequency 
distribution”; Quar. Am. Stat. Assn., Vol. XVII, 1921. 

Cave, Breatrice M. and PEARSON, KARL: ‘‘Numerical illustrations of the 
variate difference correlation method’”’; Biom., Vol. X, 1914. 

CHARLIER, C. V. L.: “Ueber das Fehlergesetz’’; Arkiv fér Matematik, 
Vol. II, Stockholm, 1905. 

—: “Researches into the theory of probability”’; Meddelanden fran 
Lunds Astronomiska Observatorium, Lunds Universitets Arsskrift. 
ING Ee Atidi2 bcd Ness. TOOG: 

CotswortH, M. B.: The Direct Calculator, Series O. (Product table to 
1000 X 1000.) M’Corquodale and Co., London. 

Cournot, A. A.: Exposition de la théorie des chances et des probabilités, 
1843. 

CRELLE, A. L.: Rechentafeln. (Multiplication table giving all products up 
to 1000 X 1000.) G. Reimer, Berlin. 

CzuBER, E.: Wahrscheinlichkeitsrechnung und ihre Anwendung auf Fehler- 
ausgleichung, Statistik und Lebensversicherung; Teubner, Leipzig, 
Ed. 1, 1903; Ed. 2, 1908-10; Ed. 3, 1921. 

DAvVEnport, C. B.: Statistical Methods; John Wiley and Sons, New York 
and Chapman and Hall, Lond., 1904. 

Day, Epmunp E.: “Classification of statistical series’; Quar. Am. Stat. 
Assn., New Series, No. 128, Vol. XVI, 1919. 

——: “Standardization of the construction of statistical tables”; Quart. 
Am. Stat. Assn., New Series, No. 129, Vol. XVII, March, 1920. 

DEMoreay, A.: “Treatise on the theory of probabilities” (Encyclopedia 
Metropolitana); 1837. 

Dickson, J. D. Hamitton: Appendix to (Galton, 1886), Proc. Roy. Soc., 
Vol. XI, p. 63, 1886. 

Dopp, Epwarp L.: Error-risk of the median compared with that of the 
arithmetic mean; Bul. 323, University of Texas, I9T4. 

Doopson, ARTHUR T.: “Relation of the mode, median and mean in 
frequency curves’; Biom., Vol. XI, 1917. 

Durrett, J. H.: “Tables of the © function”; Biom., Vol. VII, 190Q, 

EpGEwortn, F. Y.: “Observations and statistics: An essay on the theory 
of errors of observation and the first principles of statistics’’; Cam- 
bridge Phil. Trans., Vol. XIV, p. 139, 1885. 

——: “Problems in probabilities’; Phil. Mag., 5th Series, Vol. XXII, 


Pp. 371; 1886. 
——: “The choice of means”; Phil. Mag., 5th Series, Vol. XXIV, p. 268; 
1887. 


Reports of the committee appointed for the purpose of investi- 
gating the best methods of ascertaining and measuring variations in 


APPENDIX B~ 350 


the value of the monetary standard; British Association Reports 
(p. 247), 1887; (p. 181), 1888; (p. 133), 1889; and (p. 485), 1890. 

——: “On correlated averages”; Phil. Mag., 5th series, Vol. XXXIV, 
PeloO.e1co2. 

——: Article “Index numbers”’ in Palgrave’s Dictionary of Political Econ- 
omy, Vol. Il; Macmillan, 1896. 

—;: “On the representation of statistics by mathematical formulae’’; 
Jour. Roy. Stat. Soc., Vol. LXI, 1898; Vol. LXII, 1899; and Vol. 
LXIII, 1900. 

——: “The law of error’; Camb. Phil. Trans., Vol. XX, pp. 36-65 and 
113-141; 1904. 

——: “The generalized law of error, or law of great numbers”; Jour. 
Roy. Stat. Soc., Vol. LXIX, p. 497; 1906. 

——:; “On the representation of statistical frequency by a curve’’; Jour. 
Roy. Stat. Soc., Vol. LXX, p. 102; 1907. 

—: On the probable errors of frequency constants; Jour. Roy. Stat. 
Soc., Vol. LXXI, pp. 381, 499, 651; 1908. Addendum, Vol. LXII, 
p. 81; 1909. 

—; “Index numbers,’’ Dictionary of Political Economy, Vol. II, pp. 
384-387, Macmillan, 1910. 

ELDERTON, W. PALIN: “Tables for testing the goodness of fit of theory 
to observation”’; Bzom., Vol. I, 1902. 

——: “Interpolation by finite differences’’; Biom., Vol. II, 1902. 

——: “Tables of powers of natural numbers and of the sums of powers of 
the natural numbers from 1-100’’; Biom., Vol. II, 1903. 

—: Notes on statistical processes; “An alternative method of calculat- 
ing the rough moments from the actual statistics,” “The application 
of certain quadrature formulae,” “Adjustment of moments’; Bzom., 
Vol. IV, 1905. 

—: Frequency Curves and Correlation; Layton, London, 1906. 

——: “Some notes on interpolation in m-dimension space’; Bzom., 
Vol. 6, 1908. 

Everett, J. D.: “Ona new interpolation formula,” Jour. Inst. Actuaries, 
Voli xox V5 TOOT. 

Everitt, P. F.: “Tables of the tetrachoric functions for fourfold correla- 
tion tables’’; Biom., Vol. VII, 1910. 

—: “Supplementary tables for finding the correlation coefficient from 
tetrachoric groupings’’; Biom., Vol. 8, 1912. 

——: “Quadrature coefficients for Sheppard’s formula (c)’”’; Biom., Vol. 
XII, 1919. 

Feconer, G. T.: Kollektivmasslehre, herausgegeben von G. F. Lipps; 
Englemann, Leipzig, 1897. 

Fiton, L. N. G. and Pearson, Kart. “On the probable errors of fre- 
quency constants and on the influence of random selection on varia- 
tion and correlation’”’; Phil. Trans., Vol. 191, pp. 229-311, 1898. 

FisHer, ARNE: Mathematical Theory of Probabilities; Macmillan, 1915. 
Second Edition, enlarged, Macmillan, 1922. 


360 STATISTICAL METHOD 


FISHER, IrvING: Purchasing Power of Money; Rev. ed., Macmillan, 
1913. 

—: “Best form of index number”’; Quar. Am. Stat. Assn., March, 1921. 

Fisuer, R. A.: “On an Absolute Criterion for fitting frequency curves”’; 
Messenger of Math., New Series, Vol. XLI, pp. 155-160, Cambridge, 
1912. 

—: “On the distribution of the standard deviations of small samples’’; 
Biom., Vol. X, 1915. 

——: “Frequency distribution of the values of the correlation coefficient 
in samples from an indefinitely large population”; Bzom. Vol. X, 
1915. 

Fountain, H.: “Memorandum on the construction of index-numbers 
of prices’’; in the Board of Trade report on wholesale and retail prices 
in the United Kingdom, 1903. 

GALTON, FrRANcIs: “Family likeness in stature’; Proc. Roy. Soc., Vol. 
XL, p. 42; 1886. 

—: “Correlations and their measurement”; Proc. Roy. Soc., Vol. XLV, 
pp. 136-145, 1888. 

——: Natural Inheritance; Macmillan and Co., 1889. 

——: “Grades and deviates”; Biom., Vol. V, 1907. 

GarRnETT, J. C. MAXWELL: Education and World Citizenship. (With a 
statistical appendix), Cambridge University Press, 1921. 

Gauss, C. F.: Méthode des moindres carrés: Mémoires sur la combinaison 
des observations, traduits par J. Bertrand, 1855. 

Gipson, WINIFRED: “Tables for facilitating the computation of probable 
errors’’; Biom., Vol. IV, 1906. 

GREENWOOD, M.: “On errors of random sampling in certain cases not 
suitable for the application of a ‘normal’ curve of frequency”; 
Biom., Vol. IX, 1913. 

Grove, C. C.: “Mathematics and psychology’’; Math. Teacher, Vol. IX, 
1916. 

Harris, J. ARTHUR: “On the calculation of intra-class and inter-class 
coefficients of correlation from class moments when the number of 
possible combinations is large’; Biom., Vol. IX, 1913. 

——: “On spurious values of intra-class correlation coefficients arising 
from disorderly differentiation within the classes’; Biom., Vol. X., 
I9I4. 

——: “A contribution to the problem of homotyposis”; Biom., Vol. XI, 
1916. 

HASKELL, ALLEN C.: How to Make and Use Graphic Charts; Codex Book 
Co., New York, 1919. 

Heron, Davin: “An abac to determine the probable errors of correla- 
tion coefficients’; Biom., Vol. VII, 1910. 

——: “On the probable error of a partial correlation coefficient’; Biom., 
Vol. VII, rg10. 

——: “The danger of certain formulae suggested as substitutes for the 
correlation coefficient’’; Biom., Vol. VIII, tort. 


APPENDIX B 361 


Hovzincer, Kart J.: Communication “On the assumption that errors 
of estimate are equal in narrow and wide ranges”; Jour. of Ed. 
Research, Vol. IV, No. 3, p. 237, 1921. 

Hooker, R. H. “The correlation of the weather and the crops”; Jour. 
Roy. Stat. Soc., Vol. LXX, 1907. 

Hooker, R. H. and Yur, G. U.: “Note on estimating the relative 
influence of two variables upon a third”; Jour. Roy. Stat. Soc., Vol. 
LXIX, p. 197, 1906. 

HUNTINGTON, Epwarp V.: Handbook of Mathematics for Engineers, New 
York, 1918. 

——: “Mathematics and statistics, with an elementary account of the 
correlation coefficient and the correlation ratio”; Am. Math. Mo., 
Vol. XXVI, pp. 421-435. Dec., 1919. 

IssERLIs, L.: “On the partial correlation ratio. Part I. Theoretical’’; 
Biom., Vol. X, 1914. 

—: “On the partial correlation-ratio’”’; Biom., Vol. XI, 1915. 

——: “On certain probable errors and correlation coefficients of multiple 
frequency distributions with skew regression’; Biom., Vol. XI, 
1916. 

—: “On the representation of statistical data’’; Biom., Vol. XI, 1917. 

—: “On a formula for the product-moment coefficient of any order of 
a normal frequency distribution in any number of variables’’; Biom., 
Vol. XII, 1918. 

——: “Formulae for determining the mean values of products of devia- 
tions of mixed moment coefficients in two to eight variables in sam- 
ples taken from a limited population’; Bzom., Vol. XII, 1918. 

Kapteyn, J. C.: Skew frequency-curves in biology and statistics; Gréningen, 
1903. 

KELLEY, TRUMAN L.: Educational Guidance; Teachers College, Columbia 
University, 1914. 

——: “Comparable measures”’; Jour. Ed. Psyc., 1914. 

——: “A simplified method of using scaled data for purposes of testing’’; 

Sch. and Soc., Vol. IV, Nos. 79 and 80; 1916. 

Tables to facilitate the calculation of partial coefficients of cor- 
relation and regression equations; Bul. of the Univ. of Texas (out 
of print), No. 27, May, 1916. 

——: “Individual testing with completion test exercises’; Teachers 
College Record, Vol. XVIII, No. 4, Sept. 1917. 

——: “Measurement of overlapping’; Jour. of Educ. Psyc., Vol. X, 
No. 9, 1919. 

——: “Principles underlying the classification of men’’: Jour. of Applied 
Psyc., Vol. III, March, 1919. 

—: Chart to facilitate the calculation of partial coefficients of correlation 
and regression equations (containing one 7x10 chart); Stanford 
University, 1921. 

—: Alignment chart of correlation functions, 17 X 23, a supplement to 
the preceding, 1921. 


362 STATISTICAL METHOD 


KELLEY, TRUMAN L.: “Certain properties of index numbers”; Quart., 
Am. Stat. Assn., Vol. XVII, Sept., 1921. 

——: “The reliability of test scores’’; Jour. Ed. Res., May, 1921. 

—: “A new measure of dispersion’’; Quart. Am. Stat. Assn., June, 1921. 

KELLEY, TRUMAN L., and TERMAN, Lewis M.: “Dr. Ruml’s criticism 
of mental test methods”; Jour. Philosophy, Vol. XVIII, Aug., 1921. 

Keynes, J. N.: A Treatise on Probability; Macmillan, 1921. 

KinG, WiLForD IsBELL: The Elements of Statistical Methods; Macmillan, 
1912. 

Knipss, G. H.: “Prices, price indexes and cost of living in Australia,” 
Bur. of Census and Stat., Labor and Indust. Br. Report No. I, 1912. 
Also report No. 9, 1918. 

Koren, JoHN: The History of Statistics; Macmillan, 1918. 

LAPLACE, PIERRE SIMON, MaArQuis DE: Essai philosophique sur les proba- 
bilités, 1814. 

—: Théorie analytique des probabilités, 2d edition. 1814. 

Leg, ALIcE: “Tables of F (7, v) and H(r, v) Functions’; British Assn. 
Report, 1899. 

——: “Table of the Gaussian ‘tail’ functions; when the ‘tail’ is larger 
than the body”’; Biom., Vol. X, 1914. 

—: “Further supplementary tables for determining high correlations 
from tetrachoric groupings’’; Bzom., Vol. XI, 1917. 

LEE, ALICE: See Pearson and Lee (1908). 

Lexis, WILHELM: Zur Theorie der Massenscheinunger in der menschlichen 
Gesellschaft. Freiburg, 1877. 

—: ‘Ueber die Theorie der Stabilitat Statischen Reihen,’”’ Jahrbiicher 
f. Nationalékonomie u. Statistik, Vol. XXXII, 1879. 

—: Abhandlungen zur Theorie der Bevolkerungs- und Moralstatistik; 
Fischer, Jena, 1903. 

McEwen, GEorGE F. and MIcHAEL, ELLis L.: “The functional relation 
of one variable to each of a number of correlated variables determined 
by a method of successive approximations to group averages: A con- 
tribution to statistical methods’; Proc. of the Am. Acad. of Arts and 
Sciences, Vol. LV, No. 2, Dec., 1919. 

Marcu, L.: “Comparaison numérique de courbes statistiques’; Jour. de 
la Soc. de Stat. de Paris, Vol. XLVI, 1905. 

MINER, JAMES Burt: “Correlation’’; Psyc. Bul., Vol. XIII, No. 5, 1916; 
Vol. XIV, No. 5, 1917; Vol. XV, No. 4, 1918; Vol. XVI, No. 11, 
1919; Vol. XVII, No. 11, 1920. 

Miner, Joun Rice: Tables of V1 — 7? and 1 — r? for use in partial cor- 
relation and in trigonometry. Johns Hopkins Press, Baltimore, 1922. 

MITCHELL, WESLEY C.: Business cycles; California University Memoires, 
Watts LAM, sone. 

~——: Author of part 1, The making and use of index numbers, of the Bulle- 
tin of the U. S. Bureau of Labor Statistics, whole number 173; 1915. 

Moore, Cuartes N.: “On the coefficient of correlation as a measure of 
relationship’; Science, Vol. XLII, No, 1086; I9I5. 


APPENDIX B 363 


Otis, ArTHUR S.: “The Reliability of spelling scales, including a ‘devia- 
tion formula’ for correlation”; Sch. and Soc., Vol. IV, Nos. 96-99, 
1916. 

——: “A criticism of the Yerkes-Bridges point scale, with alternative 

suggestions’; Jour. Ed. Psyc., Vol. VIII, No. 3, March, 1917. 

: “An absolute point scale for the group measurement of intelligence’’; 

Jour. Ed. Psy., Vol. TX, Nos. 5 and 6, May and July, 1918. 

PAIRMAN, ELEANOR and PEARSON, KARL: ‘On corrections for the moment- 
coefficients of limited range frequency distributions when there are 
finite or infinite ordinates and any slopes at the terminals of the 
range’’; Biom., Vol. XII, 1919. 

Peart, R.: “Frequency constants of a variable z = f(«1, x2)’’; Bzom., 
Vol. VI, 1909. 

Pearson, Kart: “Contributions to the mathematical theory of evolu- 
tion (on the dissection of asymmetrical frequency-curves)’’; Phil. 
Trans. Roy. Soc., Series A, Vol. CLXXXV, p. 71; 1894. 

—: “Skew variation in homogeneous material’; Phil. Trans. A., 
Vol. CLXXXVI, pp. 343, et seq., 1895; and a supplement in Phil. 
Trans. A., Vol. CXCVII, pp. 443-459, 1901. 

—: “Regression, hereditary and panmixia’’; Phil. Trans. A., Vol. 
CLXXXVII, pp. 253-318, 1896. 

—: “On a form of spurious correlation which may arise when indices 
are used’’; etc. Proc. Roy. Soc., Vol. LX, pp. 489-498; 1897. 

“On the correlation of characters not quantitatively measurable”’; 

Phil. Trans. A., Vol. CXCV, pp. 1-47; 1900. 

“On the criterion that a given system of deviations from the prob- 
able in the case of a correlated system of variables is such that it can 
be reasonably supposed to have risen from random sampling”; Phil. 
Mag., Vol. L, 1900. 

——: “Qn the lines and planes of closest fit to systems of points in space”’; 
Phil. Mag., 1901. 

——: “On the influence of natural selection on the variability and cor- 
relation of organs”; Phil. Trans. Roy. Soc. of London, A., Vol. (CC, 
pp. I-66; 1902. 

On the mathematical theory of errors of judgment, with special 

reference to the personal equation. Phil. Trans. Roy. Soc. of London, 
A., Vol. CXCVIII, pp. 235-299; 1902. 

——: “On the systematic fitting of curves to observations and measure- 
ments”’; Biom., Vols. I and II, 1902. 

——: “Ona general theory of the method of false position”; Phil. Mag., 
1903. 

—: “On the probable errors of frequency constants’; Biom., Vol. II, 
1903. 

——: “On the theory of contingency and its relation to association and 
normal correlation,” Math. Contrib. to the Theory of Evolution, Bio- 
metric Laboratory Publications, University of London; Cambridge 
University Press, 1904. 


364 STATISTICAL METHOD 


PEARSON, Kart: “On an elementary proof of Sheppard’s formule for 
correcting raw moments and on other allied points’’; Biom., Vol. III, 
1904. 

—: “On the theory of skew correlation and non-linear regression,’ 
Math. Contrib. to the Theory of Evolution, Biometric Laboratory 
Publications; University of London, Cambridge University Press, 1905. 

—: “On the curves which are most suitable for describing the fre- 

quency of random samples of a population’’; Biom., Vol. V, 1906. 

“On certain points connected with scale order in the case of cor- 
relation of two characters, which for some arrangement give a linear 
regression line’; Biom., Vol. V, 1906. 

“On the significant or non-significant character of a sub-sample 
drawn from a sample’’; Biom., Vol. V, 1906. 

——: “Skew frequency curves’’; Biom., Vol. V, 1906. 

“On further methods of measuring correlation. Math. Contrib. 
to the Theory of Evolution”; Biometric Laboratory Publications, 
University of London; Cambridge University Press, 1907. 

—: “Reply to certain criticisms of Mr. G. U. Yule”; Biom., Vol. V, 
1907. 

——: “On the influence of double selection on the variation and correla- 
tion of two characters’”’; Biom., Vol. VI, 1908. 

——: “On a formula for determining T (x + 1)"; Biom., Vol. VI, 1908. 

——: “Ona new method of determining correlation between a measured 
character A and a character B, of which only the percentage of cases 
wherein B exceeds or falls short of a given intensity is recorded for 
each grade of A’; Biom., Vol. VII, 1909. 

——: “On the constants of index-distributions as deduced from the like 

constants for the components of the ratio, with special reference to 

the opsonic index’’; Biom., Vol. VII, 1910. 

“On a new method of determining correlation, when one variable 
is given by alternative and the other by multiple categories”’; Biom., 
Vol. VII, 1910. 

——: “On a correction needful in the case of the correlation ratio’’; 
Biom., Vol. VIII, 1911. 

——: “Further remarks on the law of ancestral heredity”; Biom., Vol. 
VIII, ror. 

——: The opsonic index. “Mathematical error and functional error.” 
Biom., Vol. VIII, 1911. 

——: “On the probability that two independent distributions of fre- 
quency are really samples from the same population”; Biom., Vol. 
VIII, ror. 

—: “On the general theory of the influence of selection on correlation 

and variation’’; Biom., Vol. VIII, 1912. 

“On a novel method of regarding the association of two variates 
classed solely in alternative categories.'’ Math. Contrib. to the Theory 
of Evolution; Biometric Laboratory Publications, University of London, 
Cambridge University Press, 1912. 


APPENDIX B 365 


PEARSON, Kart: “On the measurement of the influence of ‘Broad 
Categories’ on correlation’’; Biom., Vol. IX, 1913. 

—: “On the probable errors of frequency constants”; Biom., Vol. IX, 
1913. 

— —: “On the probable error of a coefficient of correlation as found from 

a fourfold table’; Biom., Vol. IX, 1913. 

“On the surface of constant association Q = 0.6”’; Biom., Vol. IX, 
1913. 

——: “On certain errors with regard to multiple correlation occasionally 
made by those who have not adequately studied the subject”; Biom., 
Vol. X, 1914. 

—: “Onan extension of the method of correlation by grades or ranks"’; 
Biom., Vol. X, 1914. 

—: “On the probability that two independent distributions of frequency 
are really samples of the same population, with special reference to 
recent work on the identity of trypanosome strains’; Biom., Vol. 
X, 1914. 

——: “On certain types of compound frequency distributions in which 
the components can be individually described by binomial series’’; 
Biom., Vol. XI, 1915. 

——: “On the probable error of a coefficient of mean square contingency”’; 
Biom., Vol. X, 1915. 

——: “On the application of ‘Goodness of Fit’ tables to test regression 
curves and theoretical curves used to describe observational or 
experimental data”; Biom., Vol. XI, 1916. 

—_—: “On the general theory of multiple contingency with special refer- 

ence to partial contingency’’; Biom., Vol. XI, 1916. 

“On some novel properties of partial and multiple correlation 
coefficients in a universe of manifold characteristics”; Bzom., Vol. 
XI, 1916. 

“On the probable error of biserial y"’; Biom., Vol. XI, 1917. 

: “The probable error of a Mendelian class frequency’’; Bziom., 

Weil, al, rez. 

——.: “Qn generalised Tchebycheff Theorems in the mathematical theory 

of statistics’; Biom., Vol. XII, 1919. 

“Notes on the history of correlation”; Biom., Vol. XIII, 1920. 
__: “The fundamental problem of practical statistics’; Bzom., Vol. 

XIII, 1920. 

——: “On the probable errors of frequency constants’; Biom., Vol. 

XaUIETO20; 

“On a general method of determining successive terms in a skew 
regression line’; Biom., Vol. XIII, 1921. 

- Note on the “Fundamental problem of practical statistics’ 

Vol. XIII, 1921. 

Pearson, Kari (Editor): Tables for Statisticians and LBiometricians, 
Cambridge University Press, London, 1914. 


’ 


; Biom., 


366 STATISTICAL METHOD 


PEARSON, Kart and BLAKEMAN, JOHN: “On the mathematical theory of 
random migration,’ Math. Contrib. to the Theory of Evolution, 
BiometricLaboratory Publications, University of London; Cambridge 
University Press, 1906. 

PEARSON, KARL: See Filon and Pearson. 1898. 

——: See also Blakeman and Pearson, 1906. 

——: See also Cave and Pearson, 1914. 

——: See Pairman and Pearson, 1919. 

PEARSON, Kari and Heron, Davin: “On theories of association’; Biom., 
Vola Ixe1o 13, 

PEARSON, Kart and Lrg, ALicE: “On the generalized probable error in 
multiple normal correlation”; Biom., Vol. VI, 1908. 

PEARSON and Younc, A. W.: “On the product-moments of various 
orders of the normal correlation surface of two variates’; Biom., 
Vol. XII, 1918. 

PEIRCE, B. O.: A short table of integrals; 2d rev. ed., Ginn, 1910. 

PERRIN, Emity: “On the contingency between occupation in the case of 
fathers and sons”; Biom., Vol. III, 1904. 

PERSONS, WARREN M.: “Construction of a business barometer based 
upon annual data”; Am. Econ. Rev., Vol. 6, 1916. 

——: “On the variate difference-correlation method and curve fitting’; 
Quart. Am. Statis. Assn., Vol. 16, No. 118, 1917. 

Peters, J.: Neue Rechentafeln fiir multiplikation und division (gives 
products up to 100 X 10000); G. Reimer, Berlin. 

PINTNER, RUDOLPH: “Comparison of the Ayres and Thorndike hand- 
writing scales’; Jour. Ed. Psych., Vol. V, No. 9, 1914. 

Poincare, H.: Calcul des probabilités; Gauthier-Villars, Paris, 1896. 

Potsson, S. D.: “Sur la proportion des naissances des filles et des 

garcgons’’; Mémoires de l'Acad. des Sciences, Vol. IX, p. 239; 1829. 

: Recherches sur la probabilité des jugements, etc. Paris, 1837. 

QUETELET, L. A. J.: Lettres sur la théorie des probabilités, appliquée aux 
sciences morales et politiques; 1846. (English translation by ORG: 
Downes, 1849.) 

Rutnp, A.: ‘‘ Tables to facilitate the computation of the probable errors 
of the chief constants of skew frequency distributions’’; Biom., 
Vol. VII, 1909-10. 

Rretz, H. L.: “On functional relations for which the coefficient of cor- 
relation is zero”’; Quar. Am. Stat. Assn., 1919. 

——: “Urn schemata as a basis for the development of correlation theory”’; 
Annals of Math., Vol. XXI, 1920. 

Ritcuie-Scort, A.: “Note on the probable error of the coefficient of cor- 
relation in the variate difference correlation method”’; Biom., Vol. 
XI, 1915. 

——: “The correlation coefficient of a polychoric table’; Biom., Vol. XII, 
1918. 

RuGG, HaRo_p Orpway: Statistical Methods A pplied to Education; Hough- 
ton, Mifflin, 1917, 


APPENDIX B 367 


Rumi, BEARDSLEY: “Measure of the efficiency of mental tests’; Psy. 
Rev., Vol. XXIII, No. 6, 1916. 

SECRIST, Horace: An Introduction to Statistical Methods; Macmillan, 1917. 

SHEPPARD, W. F.: “On the application of the theory of error to cases of 
normal distribution and normal correlation”; Phil. Trans. A., Vol. 
CXCII, pp. 101-167, 1898. 

—: “On the calculation of the most probable values of frequency con- 
stants for data arranged according to equi-distant divisions of a 
scale’; Proc. Lon. Math. Soc., Vol. XXIX, pp. 353-380; 1898. 

—: “On the calculation of the double-integral expressing normal cor- 

relation’’; Cambridge Phil. Trans., Vol. XIX, p. 23, 1900. 

“New tables of the probability integral’; Bzom., Vol. II, 1903. 

—: “The calculation of the moments of a frequency-distribution”’; 
Biom., Vol. V, 1907. 

SMITH, KirsTINE: “On the ‘best’ values of the constants in frequency 
distributions’; Bzom., Vol. XI, 1916. 

—: “On the standard deviations of adjusted and interpolated values 
of an observed Polynomial Function and its constants and the guid- 
ance they give towards a proper choice of the distribution of observa- 
tions’; Biom., Vol. XI, 1917. 

Snow, E. C.: “Application of the method of multiple correlation to the 
establishment of post-censual populations’”’; Jour. Roy. Stat. Soc., Vol. 
LXXIV, pp. 575-629, London, 1911. 

—: “Application of the correlation coefficient to Mendelian distribu- 
tions’; Biom., Vol. VIII, 1912. 

SoMMERVILLE, D. M. Y.: “On the classification of frequency ratios”’: 
Biom., Vol. V, 1906. 

Soper, H. E.: ‘On the probable error of the correlation coefficient to a second 
approximation”; Biom., Vol. TX, 1913. 

——: “On the probable error of the bi-serial expression for the correlation 
coefficient”’; Biom., Vol. X, 1914. 

“Tables of Poisson’s exponential binomial limit’’: Biom., Vol. X, 

1914. 

aa tL E., Younc, A. W., Cave, B. M., LEE, ALICE and PEARSON, 
Kart: “On the distribution of the correlation coefficient in small 
samples’’: Biom., Vol. XI, 1917. 

SpEARMAN, C.: “The proof and measurement of association between two 
things’; Amer. Jour. of Psyc., Vol. XV, 1904. 

——: “A footrule for measuring correlation”; Brit. Jour. of Psyc., Vol. 
II, p. 89; 1906. 

———: “Demonstration of formule for true measurement of correlation”’; 
Am. Jour. Psyc., Vol. XVIII, 1907. 

——-: “Coefficient of correlation calculated from faulty data”; Brit. Jour. 
of Psy., Vol. III, 1910. 

——_: “Correlations of sums and differences”; Brit. Jour. of Psy., Vol. 
V, 1913. 

STuDENT: “Probable error of a correlation coefficient”; Biom., Vol. VI, 1908. 


308 STATISTICAL METHOD 


STUDENT: “Probable error of a mean”’; Biom., Vol. VI, 1908. 

——: “On the distribution of the means of samples which are not drawn 
at random’’; Biom., Vol. 7, 1909. 

—: “The correction to be made to the correlation ratio for grouping”’; 
Biom., Vol. IX, 1913. 

——: “The elimination of spurious correlation due to position in time or 
space’’; Biom., Vol. X, 1914. 

——: “Tables for estimating the probability that the mean of a unique 
sample of observations lies between —* and any given distance 
of the mean of the population from which the sample is drawn’’; 
Biom., Vol. XI, 1917. 

——: “An explanation of deviations from Poisson’s law in practice’’; 
Biom., Vol. XII, 1919. 

——: “An experimental determination of the probable error of Dr. 
Spearman’s correlation coefficients’; Biom., Vol. XIII, 1921. 

TcHouprorr, AL. A.: “On the mathematical expectation of the moments 
of frequency distributions’’; Biom., Vol. XII, Part One, 1918; Part 
Two, 1919; Biom., Vol. XIII; Part Three, 1921. 

TERMAN, Lewis M.: “The intelligence quotient of Francis Galton in 
childhood”’; Am. Jour. of Psy., Vol. XXVIII, 1917. 

TERMAN, LEwis M.: See also Kelley and Terman, 1921. 

THIELE, T. N.: Theory of Observations; London, 1993. 

THomson, Goprrey H.: “The criterion of goodness of fit of psycho- 
physical curves’; Biom., Vol. XII, 1919. 

——: “A direct deduction of the constant process used in the method of 
right and wrong cases’’; Psych. Rev., Vol. XXVI, No. 6, 1919. 

——: “On the degree of perfection of hierarchical order among correla- 
tion coefficients’’; Biom., Vol. XII, 1919. 

——: See also Brown and Thompson, 1921. 

THORNDIKE, Epwarp L.: Empirical Studies in the Theory of Measure- 
ment; Archives of Psyc. (New York), 1907. 

—: Mental and Social measurements; Teachers College, Columbia 
University. 1913. 

TuursToneE, L. L.: “A scoring method for mental tests”’; Psy. Bul., 
VolecVile Now7, 1910: 

TopuuntTER, I. A.: History of the Mathematical Theory of Probability from 
the Time of Pascal to that of Laplace; Macmillan, 1865. 

Ursan, F. M.: “The application of statistical methods to the problems 
of psychophysics”; Exper. Studies in Psyc. and Ped., III, Phila- 
delphia, 1908. 

——: “Die psychophysischen Massmethoden als Grundlagen empirischer 
Messungen"’; Archiv f. d. ges. Psychol., Vol. XVI, 1909. 

——: Die Praxis der Konstanzmethode; Leipzig. 19t2. 

VENN, J.: The Logic of Chance: an essay on the foundations and province 
of the theory of probability, with especial reference to its logical 
bearings and its application to moral and social science and to 
statistics. Macmillan & Co., London, Third ed.; 1888. 


APPENDIX B 3609 


Watsu, C. M.: Measurement of General Exchange Value. Macmillan, 
190l. 

—: Problem of Estimation; P.S. King, London, 1921. 

West, C. J.: Introduction to Mathematical Statistics; R. G. Adams & Co., 
Columbus, Ohio, 1918. 

WESTERGAARD, H.: Die Grundztige der Theorie der Statistik; Fischer, 
Jena, 1890. 

WHIPPLE, GEORGE CHANDLER. Vital statistics; Wiley, 1919. 

Wuirerte, G. M.: Manual of Mental and Physical Tests; Sec. Ed., Two 
Parts; Warwick and York. 1914. 

WHITAKER, Lucy: “On Poisson's law of small numbers’; Biom., Vol. X; 
1914. 

WoopwortTH, RoBeERT SEssions: “Combining the results of several tests: 
a study in statistical method”; Psyc. Rev., Vol. XIX, 1912. 

Wricut, THomAs W. and Hayrorp, JoHN F.: Adjustment of Observations 
by the Method of Least Squares with Applications to Geodetic Work; 
D. Van Nostrand Co., 1906. 

YERKES, Rosert M. (Editor): “Psychological examining in the United 
States army”’; Nat’! Acad. of Science, Vol. XV, 1921. 

Younc, ANDREW W.: “Note on the standard deviations of samples of 
two or three’; Biom., Vol. XI, 1916. 

Younc, ANDREW W. and Pearson, Kari: “On the probable error of a 
coefficient of contingency without approximation”; Bziom., Vol. 
XI, 1916. 

Yue, G. U.: “Notes on the history of pauperism”’; Jour. of Roy. Stat. 
Soc., Vol. LIX, pp. 318-357; 1896. 

——: “On the significance of Bravais’ formule for regression, etc., in 
the case of skew correlation’’; Proc. Roy. Soc., Vol. LX, p. 477, 1897. 

——: “On the theory of correlation”; Jour. Roy. Stat. Soc., Vol. LX, 
1897. 

——: “On the association of attributes in statistics’; Phil. Trans. Roy. 
ee Series A, Vol. CXCIV, p. 257, 1900. 

——.: Notes on the theory of association of attributes in statistics. Biom. 
Vol. 2, 1903. 

—: “ On the theory of correlation for any number of variables treated 
by a new system of notation”; Proc. Roy. Soc., Series A, Vol. LX XIX, 
p. 182, 1907. 

——: “On the interpretation of correlations between indices or ratios”’ 
Jour. Roy. Stat. Soc., Vol. LX XIII, p. 644, 1910. 

—_: “On the methods of measuring the association between two attri- 
putes”; Jour. Roy. Stat. Soc., Vol. LXXV, 1912. 

——: Introduction to the Theory of Statistics; Lippincott, 1912. 

Yue, G. U.: See Hooker, R. H. and Yule, (Ge We, Wor, 

ZIMMERMAN, H.: Rechentafel, nebst Sammlung haufiz gebrauchter Zahlen- 
werthe; W. Ernst and Son, Berlin. Eng. edition, Asher and Co., 
London, 


APPENDIX C 


KELLEY-WOOD TABLE 
OF THE NORMAL PROBABILITY INTEGRAL 


APPENDIX C 
.500 


-O00000 


-398942 


.250000 


.002507 
.005013 
.007520 


.010027 


012533 
.015040 


-017547 
-020054 
.022562 


-398941 
-398937 
-398931 
-398922 
-398911 
-398897 
.398881 
.398862 
.398841 


-249999 
-249996 
-249991 


-249984 
-249975 
-249964 


249951 
-249936 
.249919 


.025069 


-398816 


.249900 


.027576 
.030084 


.032592 
.035100 


.037608 
.O40117 


.042626 
-045135 
-047644 


.050154 


-398791 
.398762 
-398730 


-398697 
.398660 
-398621 


.398580 
-398536 
-398490 


-249879 
.249856 
.249831 


.249804 
-249775 
-249744 


-249711 
.249676 
-249639 


.398441 


-249600 


.052664 
055174 
.057684 


.060195 
.062707 
.065219 


.067731 


.070243, 
.072756 


.398389 
.398336 
398279 
.398220 
398159 
.398096 
.298028 
397959 
397888 


-249559 
.249516 
249471 


249424 
249375 
249324 


.249271 
.249216 
249159 


075270 


.397814 


.249100 


077784 
.080298 
.082813 


.085329 
.087845 
.090361 


.092878 
095395 
.097914 


397737 
-397958 
-397577 


-397493 
.397406 
-397317 


397225 
-397131 
-3970934 


-249039 
.248976 
.248911 


.248844 
-248775 
.248704 
.248631 
.248556 
-248479 


.100434 


396935 


.248400 


374 STATISTICAL METHOD 
.040 .460 .540 


x Zz g 2/q 2/p 


.100434 396935. .460 .86290 | .73506 | .248400 


.102953 | -396834 | .459 | -86456 | .73352 | .248319 
.105474 | .396729 | .458 .86622 | .73197 | .248236 
-107995 | .396623 | .457 .86788 | .73043 | .248151 


.ITI0516 | .396513 | .456 .86955 | .72888 | .248064 
.113039 | -396401 | .455 87121 | .72734 | .247975 
-115562 | .396287 | .454 .87288 | .72580 | .247884 


.I18085 | .396170 | .453 -87455 | -72426 | .247791 
-120610 | .396051 | .452 .87622 .72272 | .247696 
-123135 | -395929 | -451 87789 | .72118 | .247599 


.125661 | .395805 | .459 .87957 | -.71964 | .247500 


.128188 | .395678 | .449 .88124 | .71811 -247399 
.130716 | .395549 | .448 .88292 | .71657 | .247296 
-133245 | 395417 | -447 -88460 | .71504 | .247191 


-135774 | .395282 | .446 -88628 | .71350 | .247084 
138304 | .395145 | -445 -88797 | .71197 | .246975 
-140835 | .395005 |] .444 -88965 | .71044 | .246864 


.143367 | .394863 | .443 -89134 | .70891 | .246751 
-145900 | .394719 | .442 -89303 | .70738 | .246636 
-148434 | -394572 | .441 -89472 | .70585 | .246519 


.150969 | .394422] . .89641 -70432 | .246400 


-153505 | .394270 | .4: 89811 -70280 | .246279 
.156042 | .394115] . .89981 .70127 | .246156 
.158580 | .393957 | . -QOI50 | .69975 | .246031 


161119 | .393798 | . -90321 69822 | .245904 
.163658 | .393635 | . -JO491 .69670 -245775 
.166199 | .393470 | . -goo61 -69518 | .245644 


.168741 | .393303 | .4< -90832 | .69366 | .245511 
er 7 0205 e303 3316 -91003 | .69214 | .245376 
.173829 -392960 : -QI1174 | .69062 | .245239 


SL 7OR 7A, ‘ 91345 .68910 | .245100 


.178921 | .392608 -QI1517 | .68758 | .244959 
-181468 | .392427 -Q1689 | .68606 | .244816 
-184017 | .392245 .QI86I | .68455 | .244671 


.186567 | .392059 -92033 | .68303 | .244524 
-I8Q9118 | .391870 .92205 -O8I51 -244375 
-IQ1671 | .391681 -92378 | .68000 | .244224 


-194225 | .391488 


NNN 


OV SI 00.0 


NNN 


92550 | .67849 | .244071 
.196780 | .391293 -92723 | .67698 | .243916 
-199336 | .391095 -92897 | .67547 | .243759 


.201893 | .390894 | . -93070 | .67396 | .243600 


bAR AAR RRA 
eB Nw Uu 


NNN 


APPENDIX C 375 
.080 .420 580 


z 2/q 
-201893 | .390894 | . OROVY |) -243600 


204452 | .390691 | . -93244 | . -243439 
-207013 | .390485 |] . O84 aly. -243276 
-209574 | .390277 | . -93592 | . -243111 


"202137 || -390066) ||. Ces ||. -242044 
.214702 | .389852 | . -93940 | . .242775 
-217267 | .389636 | . -Q41I5 | . -242604 


.219835 | .389418 | . -94290 | . -242431 
.222403 | .389197 | . 94465 | . -242256 
-224973 | .388973 | . 94641 -242079 


-227545 | .388747 | . 94816 | . .241900 


.230118 | .388518 | . .94992 | . -241719 
.232693 | .388287 | . 95168 | . .241536 
-235269 | .388053 | . 95345 | - -241351 


.237847 | .387816 | . CEker || . .241164 
-240426 | .387577 | . .95698 | . .240975 
.243007 | .387335 | . Oss || .240784 


.245590 | .387091 | . 96052 | . 240591 
.248174 | .386844 | . .96230 | . .240396 
-250760 | .386595 | . .96408 | . .240199 


253347 | .386342 | . .96586 | . .24.0000 


.255936 | .386088 | . .96764 | . -239799 
.258527 | .385831 | .: .96942 | . -239596 
2OLUZON 3055700 || Ove || ¢ -239391 


.263714 | .385308 | . 197/300). .239184 
.266311 | .385043 | . 207479) lee -238975 
-268909 | .384776 | . 97659 | . .238764 


.271508 | .384506 | . LO7S30- .238551 
P27 ATHOMN ne 3SA223) | 2302 -98019 | . .238336 
E27 OFA EA S305 7) |e .98199 | . .238119 


-279319 | .383679 | . -98379 | . -237900 


.281926 | .383399 | . -98560 | . .237679 
.284536 | .383115 | . 98741 | . .237456 
5207147 || 392030 |. .98922 | . .237231 


.289760 | .382541 | . .99104 | . .237004. 
-292375 | .382250 | . FOOZ COE .236775 
-294992 | .381956 | . -99468 | . .236544 
.297611 | .381660 | . -99650 | . .236311 
2300232) ||. Gorgon) |. Soe || 6 .236076 
.302855 | .381060 | . 1.00016 | . .235839 


.305481 | .380755 | . 1.00199 | . .235600 


376 STATISTICAL METHOD 
I y z q | 2/¢ | 2/2) \\ “POR ee 
120 305481 | .380755 | -380 | I-90199 | .61412 -235600 | .620 
121 .308108 | .380449 | .379 | 1.00382 | .61264 | .235359 | .621 
122 | .310738 | .380139 | .378 | 1.00566 | .61116 | .235116 | .622 
123 | .313369 | .379827 | -377 | 1.00750 | .60967 | .234871 | .623 
124 | .316003 | .379513 | .376 | 1.00934 | .60819 | .234624 | .624 
125 .318639 | -379195 | -375 | I-O11I9 | .60671 | .234375 | .625 
1126 | .321278 | .378875 | .374 | 1.01303 | .60523 | .234124 | .626 
127 | .323918 | .378553 | -373 | 1.01489 | .60375 | .233871 | .627 
128 | .326561 | .378227 | .372 | 1.01674 | .60227 | .233616 | .628 
129 329206 | .377900 | .371 1.01860 | .60079 | .233359 | .629 
.130 | .331853 | -377569 | .3'70 | 1.02046 | .59932 | .233100 | .630 
131 334503 | -377236 | -369 | 1.02232 | .59784 | .232839 | .631 
Ag .337155 | 376900 | .368 | 1.02418 | .59636 | .232576 | .632 
.133 | -339809 |..376562 | .367 | 1-02605 | .59488 | .232311 | .633 
.134 342466 | .376220 | .366 1.02792 | .59341 -232044 | .634 
-135 | -345125 | -375877 | -365 | 1-02980 | .59193 | .231775 | .635 
.136 | .347787 | -375530 | -364 | 1.03168 | .59046 | .231504 | .636 
.137 | .350451 | 375181 | -363 | 1.03356 | .58898 | .231231 | .637 
138 | .353118 | -374829 | .362 | 1.03544 | .58751 | .230956 | .638 
-139 | .355787 | -374475 | -361 | 1.03733 | .58603 | .230679 | .639 
140 | .358459 | -374118 | .360 | 1.03922 | .58456 | .230400 | .640 
141 -361133 | 373758 | -359 | I-O41II | .58309 | .230119 | .641 
142 -363810 | .373395 | -358 1.04300 | .58161 -229836 .642 
-143 -366489 | .373030 | -357 | 1.04490 | .58014 | .22955I1 .643 
.144 | .369171 | .372662 | .356 | 1.04680 | .57867 | .229264 | .644 
-145 -371856 | .372292 | -355 | 1.04871 | .57720 | .228975 | .645 
-146 | .374544 | 371919] -354 | 1.05062 | .57573 | .228684 | .646 
-147 377234 | -371543 | -353 | 1.05253 | .57426 | .228391 | .647 
-148 | .379927 | .371164 | -352 | 1.05444 | .57278 | .228096 | .648 
.149 -382622 | .370783 | -351 1.05636 | .57131 .227799 | .649 
-I50 | .385320 | .370399 | -350 | 1.05828 | .56984 | .227500 | .650 
-I51 -388022 | .370012 | .349 1.06021 .56837 -227199 .6O51 
152 -390726 | .369623 | .348 1.06214 | .56691 .226896 | .652 
-153 | .393433 | 369231 | .347 | 1.06407 | .56544 | .226591 | .653 
-154 | .396142 | .368836 | .346 | 1.06600 | .56397 | .226284 | .654 
-155 | .398855 | .368439 | .345 | 1.06794 | .56250 | .225975 | .655 
-156 | .401571 | .368038 | .344 | 1.06988 | .56103 | .225664 | .656 
157 | .404289 | .367635 | .343 | 1.07182 | .55957 | .225351 | .657 
-158 -40701I | .367230 | .342 | 1.07377 | .55810 | .225036 | .658 
-159 -409735 | .366821 | .341 1.07572 | .55663 | .224719 | .659 
-160 | .412463 | .366410 | .340 | 1.07768 | .55517 | .224400 | .660 
.160 -340 .660 


APPENDIX C 377 
.160 .340 .660 


Z 2/q 


-412463 | .366410 |] . 1.07708 |e -224400 


-415194 | .365996 | . 1.07963 | . .224079 
-417928 | .365580 | . 1.08160 | . .223756 
-420665 | .365160 | . 1.08356 | . .223431 


-423405 | .364738 | . .08553 | . .223104 
-426148 | .364314 | . .08750 | . .222775 
-428895 | .363886 | . 1.08948 | . .222444 


-431644 | .363456 | - .09146 | . .222T11 
-434397 | 363023 | - -09344 | . -221776 
-437154 | 362587 | - 1.09543 | - -221439 


.439913 | .362149 | - LOOWA2 anlar -221100 


-442676 | .361707 | .- .09941I | . .220759 
-445443 | .361263 | . ei OA | Tami .220416 
.448212 | .360817 | . SLO3A2 alae .220071 


.450986 | .360367 | . pLO5 42a eee .219724 
453762 | -359915 | - 10743 | . 219375 
456542 | -359459 | - -10944 | . -219024 


-459326 | .359001 | . HedieAKey || .218671 
.462113 | .358541 | . Sees) || .218316 
.464904 | .358077 | . SES 5 Ones -217959 


.467699 | .357611 | .- MIA || 6 .217600 


-479497 | -357142 | - -TI957 | - -217239 
.473299 | .356670 | . gnieniteray |) .216876 
.476104 | .356195 | - mL2204 al. .216511 


478914 | .355718 | . .12569 | . .216144 
-481727 | -355237 | - -I12774 | - -215775 
-484544 | -354754 | - -12979 | - -215404 


.487365 | .354268 | . aUBINaISS || .215031 
.490189 | .353780 | . .13391 ; -214656 
.493018 | .353288 | . BOE || t .214279 


-495850 ! .352793 | - LZ OO4 ume .213900 


-498687 | .352296 | . SKOIAS || .213519 
.501527 | .351796 | .: AZ TOM es .213136 
.504372 | .351293 | - RiAA2 ON .212751 


PSO7220 Wes oO Vo Thee .14636 | . .212364 
-510073 | .350279 | .- -14846 | . .211975 
.512930 | .349767 | . a 5055 aie .211584 


-515792 | .349253 | - .15265 | . .211191 
.518657 | .348736 | .-: PL 47/5 alae .210796 
.521527 | .348216] . .15686 | . -210399 


-524401 | .347693 | - 15898 | . .210000 
-200 -300 -'700 


378 STATISTICAL METHOD 
.200 .300 -700 


z Pq 


.524401 “347693 : : .49670 | .210000 


527279 | .347167 | . . 49525 | -209599 
-530161 | .346638 | . 21 G; .49379 | .209196 
-533048 | .346107 | . : .49233 | .208791 


-535940 | .345572 | - . .49087 | .208384 
-538836 | .345035 | -295 .48941 | -207975 
541737 | -344494 | - . 48795 | -207564 


-544642 | .343951 | . ; .48649 | .207I5I 
547551 | .343405 | - 17604 | .48504 | .206736 
-550466 | .342856 | . : .48358 | .206319 


553385 | -342304 | .- ; .48212 | .205900 


.556308 | .341749 | . -1825 .48066 | .205479 
559237 | -341I9I | . : -47920 | .205056 
-562170 | .340631 | . ; .47774. | .204631 


.565108 | .340067 | . . .47628 | .204204 
-568051 | .339500 | .28 ‘ .47483 | .203775 
-570999 | -338931 | - . 47337 | -203344 
573952 | -338358 | .28: 195 -A7IQI | .202QII 
-576910 | .337783 | .28 .1978 .47045 | .202476 
-579873 | .337205 | .- -20002 | .46899 | .202039 


.582841 | .336623 | . : .46753 | .201600 


.46607 | .201159 
.46461 | .200716 
.46315 | .200271 


585815 | .336039 
588793 | .335452 
591777 | .334861 


-594766 | .334268 
-597760 | .333672 
.600760 | .333073 


-603765 | .332470 3 
.606775 | .331865 45586 | .198016 
609791 | .331257 45440 | .197559 


.612813 | .330646 | . .224€ -45294 | .197100 


NON nN 
~sI~INI 
“NI CO\O 


.46170 | .199824 
-46024 | .199375 
.45878 | .198924 


-45732 | .198471 


bbb 
NNN 
ReNwW UID 


NVvw 
SSN 


615840 | .330031 | . ; ; .45148 | .196639 
.618873 | .329414 | . .22 .45002 | .196176 
SO2TOT2 828703) le ney .44856 | .1957I1I 
.624956 | .328170 | . F -44710 | .195244 
.628006 | .327544 | .265 .23601 44564 | .194775 
.631062 | .326914 | . 23831 | .44418 | .194304 
634124 | .326281 | . 24061 | .44272 | .193831 
-637192 | .325646 | . .24292 | .44125 | .193356 
-640265 | .325007 | . -24524 | .43979 | .192879 


-643345 | 324365 | . -24756 | .43833 | -192400 


APPENDIX C 
.260 


i | #% Pear eyo a 27 8a Pd we eP 
-643345 | -324365 .24.750 .192400 

241 .646431 | .323720 | .259 | 1.24988 | .43687 | .I9I919 | .741 
.242 .649524 | .323072 | .258 | 1.25222 | .43541 | .191436 | .742 
243 .652622 | .322421 | .257 | 1.25456 | .43394 | .190951 | .743 
.244 655727 | .321767 | .256 | 1.25690 | .43248 | .190464 | .744 
.245 .658838 | .321110 | .255 1.25925 | .43102 | .189975 | .745 
.246 .661955 | .320449 | .254 | 1.26161 | .42956 | .189484 | .746 
247 .665079 | .319786 | .253 | 1.26398 | .42809 | .188991 | .747 
.248 668209 | .319119 | .252 | 1.26635 | .42663 | .188496 | .748 
.249 .671346 | .318449 | .251 | 1.26872 | .42517 | .187999 | .749 
.250 .674490 | .317777 | -250 | 1.27111 -42370 | .187500 | .750 
251 .677640 | 317101 | .249 | 1.27350 | .42224 | .186999 | .751 
252 | .680797 | .316421 | .248 | 1.27589 | .42077 | .186496 | .752 
253 .683961 | .315739 | .247 | 1.27830 | -41931 .185991 | .753 
254 | .687131 | .315053 | .246 | 1.28070 .41784 | .185484 | .754 
255 690309 | .314365 | .245 | 1.28312 ALOZS | eeLo4O7s eo 
286 | .693493 | -313073 | .244 | 1.28554 | 41491 | .184464 | -756 
.257 .696685 | .312978 | .243 1.28798 | .41345 | .183951 | -757 
.258 .699884 | .312279 | .242 | 1.29041 | .41 198 | .183436 | .758 
.259 | .703090 | .311578 | .241 1.29285 | .41051 | .182919 | .759 
260 | .706303 | .310873 | .240 | 1.29531 | .40904 | -I 82400 | .760 
.261 .709523 | -310165 | .239 1.29776 | .40758 | .181879 | .761 
262 712751 | .309454 | .238 | 1.30023 40611 | .181356 | .762 
.263 .715986 | .308740 | .237 | 1.30270 .40464 | .180831 | .763 
.264 719229 | .308022 .236 | 1.30518 | -40317 .180304 | .764 
.205 .722479 | .307301 | .235 13070724 AOL Om reli 97> .705 
266 | .725737 | -306577 | -234 I.31016 | .40023 | .179244 | .766 
.267 .729003, | -305850 | .233 1.31266 | .39876 | .178711 | .767 
268 732276 | .305119 | .232 | 1-31517 | 39729 miztiighey || Aes’ 
.269 735558 | 304385 | -231 1.31768 | .39582 | .177639 .769 
.2'70 738847 | .303648 | .230 | 1.32021 39435 | .177100 779 
Zifi 742144 | .302908 | .229 | 1.32274 .39288 | .176559 | .771 
os 745450 | .302164 | .228 1.32528 | .39140 | .176016 | .772 
273 -748763 | 301417 | .227 1.32783 | .38993 | -175471 | -773 
274 .752085 | .300666 | .226 | I .33038 -38846 .174924 | .774 
275 755415 | -299913 | -225 | 1.33294 38698 | .174375 | -775 
.276 758754 | 299155 | -224 | 1-3355! .38551 | -173824 | -776 
2 .762T101 | .298395 | -223 1.33809 | .38403 | .173271 | -777 
258 .765456 | .297631 | .222 1.34068 | .38256 p27 LOM en 7e 
.279 .768820 | .296864 | .221 1.34328 | .38108 | .172159 | -779 
.280 .772193 | .296094 1.34588 .171600 


380 STATISTICAL METHOD 
.280 .220 -780 
I x Z q 2/q | 2/p | bg p 
280 772193 | 296094 | .220 | 1.34588 | .38069 | .171600 | .780 
.281 ‘77595795 | -295320 | .219 | 1.34849 | .37813 | .171039 | .781 
.282 -778966 | .294542 218 1.35111 -37665 | .170476 | .782 
.283 -782365 | .293762 | .217 | 1.35374 | .37517 -I699II | .783 
.284 -785774 | .292978 | .216 1.35638 | .37370 | .169344 | .784 
-285 -789192 | .292190 | .215 | 1.35902 | .37222 | .168775 .785 
.286 -792619 | .291399 | .214 | 1.36168 | .37074 -168204 | .786 
.287 -796055 | .290605 | .213 | 1.36434 | .36926 -167631 | .787 
.288 799501 | .289807 | .212 | 1.36701 | .36778 | .167056 .788 
.289 -802956 | .289006 | .211 | 1.36970 | .36629 -166479 | .789 
.29) -806421 ; .288201 | .210 | 1.37239 | .36481 | .165900 -79) 
291 -809896 | .287393 | .209 | 1.37509 | .36333 | .165319 -791 
-292 | .813380 | .286582 | .208 | 1.37780 | .36185 | .164736 .792 
.293 -816875 | .285766 | .207 | 1.38051 -36036 | .164151 | .793 
-294 | .820379 | .284948] .206 | 1 -38324 | .35888 | .163564 | .794 
295 | -823804 | .284126 | .205 | 1.38598 | .35739 | .162975 | .795 
-296 | .827418 | .283300 | .204 | 1.38873 -35590 | .162384 | .796 
.297 -830953 | -282471 | .203 | 1.39148 -35442 | .I161791 | .797 
-298 | .834499 | .281638 | .202 | 1.39425 -35293 | .I61196 | .798 
-299 | .838055 | .280802 | .201 | 1.39702 -35144 | .160599 | .799 
-300 | .841621 | .279962 | .200 | 1.39981 -34995 | -160000 | .800 
301 -845199 | .279118 | .199 | 1.40260 -34846 | .159399 | .801 
302 -848787 | .278272 | .198 | 1.40541 -34697 | .158796 | .802 
+303 | -852386 | .277421 | .197 | 1.40823 -34548 | .I58191 | .803 
304. 855996 | .276567 | .196 1.41106 | .34399 | .157584 .804 
-305 -859617 | .275709 | .195 1.41389 | .34250 | .156975 | .805 
-306 | .863250 | .274847| .194 | 1 -41674 | .34100 | .156364 | .806 
.307 .866894 | .273982 |] .193 1.41960 | .3395I | .155751 .807 
.308 -870550 | .273114 192 1.42247 | .33801 | .155136 | .808 
-309 .874217 | .272241 | .191 1.42535 | -33652 | .154519 | .809 
-310 .877896 | .271365 |] .190 1.42824 | .33502 | .153900 810 
he 881587 | .270486 | .189 1.43114 | .33352 | .153279 SII 
ae! -885291 | .269602 | .188 1.43405 | .33202 | .152656 812 
els 889006 | .268715 | .187 1.43698 | .33052 | .152031 813 
314 892733 | .267824 | .186 1.43991 | .32902 | .151404 | .814 
‘315 | -896473 | .266929 | .185 | 1.44286 .32752 | .150775 815 
-316 -900226 | .266031 | .184 | 1.44582 -32602 | .150144 | .816 
317 -90399I | .265129 | .183 1.44879 | .32452 | .149511 817 
318 -907770 | .26422 182 | 1.45177 | .32301 | .148876 | .818 
319 -OII561 | .263313 | .181 1.45477 | .32151 | .148239 | .819 
+320 -915365 | .262400] .180 1.45778 | .32000 | .147600 | .820 
-320 -180 .820 


APPENDIX C 381 
.320 .180 .820 


fi x z Ge\2/q | 2/p | pq p 


.320 -915365 | .262400 | .180 | 1.45778 | .32000 | .147600 | .829 


.321 -919183 | .261483 | .179 | 1.46080 | .31849 | .146959 | .821 
«322 -923014 | .260562 | .178 | 1.46383 | .31699 | .146316 | .822 
323 | .926859 | .259637] .177 | 1.46688 | .31548 | .145671 | .823 


.324 930717 | .258708 | .176 | 1.46993 | .31397 | .145024 | .824 
-325 | -934589 | -257775 | -175 | 1.47300 | .31245 | .144375 | .825 
320) 938476 | .256839 | .174 | 1.47609 | .31094 | .143724 | .826 


a2 7, 942376 | .255898 | .173 | 1.47918 | .30943 | .143071 | .827 
-328 946291 | .254954 | .172 | 1.48229 | .30792 | .142416 | .828 
-329 .950221 | .254006 | .171 1.48541 | .30640 | .141759 | .829 


-330 -954165 | .253054 | .170 | 1.48855 | .30488 | .141100 | .830 


+331 -958125 | .252097 | .169 | 1.49170 | .30337 | .140439 | .831 
332 -962099 | .251137 | .168 | 1.49486 | .30185 | .139776 | .832 
333 966088 | .250173 | .167 | 1.49804 | .30033 | .13911I | .833 


-334 | -970093 | .249205 | .166 | 1.50123 | .29881 | .138444 | .834 
-335 | -974114 | .248233 | .165 | 1.50444 | .29728 | .137775 | .835 
-336 -978150 | .247257 | .164 | 1.50766 | .29576 | .137104 | .836 


-337 | -982203 | .246277 | .163 | 1.51090 | .29424 | .136431 | .837 
-338 986271 | .245292 | .162 | 1.51415 | .29271 | .135756 | .838 
7330) -990356 | .244304 |] .161 1.51742 | .29118 | .135079 | .839 


-340 -994458 | .243312 | .160 | 1.52070 | .28966 | .134400 | .84) 


-341 -998576 | .242315 | .159 | 1.52399 | .28813 | .133719 | .841 
34201 1.002712) 241315) 3153 ie5273 .28660 | .133036 | .842 
-343 | 1.006864 | .240310 | .157 | 1.53064 | .28507 | .132351 | .843 


-344 | 1.011034 | .23930I | .156 | 1.53398 | .28353 | .131664 | .844 
-345 | 1.015222 | .238288 | .155 | 1.53734 | .28200 | .130975 | .845 
.346 | 1.019428 | .237270] .154 | 1.54071 | .28046 | .130284 | .846 


-347 | 1.023651 | .236249 ] .153 | 1.54411 | .27892 | .129591 | .847 
.348 | 1.027893 | .235223 ] .152 | 1.54752 | .27739 | .128896 | .848 
-349 | 1.032154 | .234193 } .I51 1.55095 | .27585 | .128199 | .849 


.350 | 1.036433 | .233159 | .150 | 1.55439 | .27430 | .127500 | .850 


.351 | 1.040732 | .232120] .149 | 1.55785 | .27276 | .126799 | .851 
-352 | 1.045050 | .231077 | .148 1.56133 | .27122 | .126096 | .852 
.353 | 1.049387 | .230030 | .147 | 1.56483 | .26967 | .125391 | .853 


-354 | 1.053744 | .228979 | .146 | 1.56835 | .26813 | .124684 | .854 
355 | 1.058122 | .227923 |] .145 | 1.57188 | .26658 | .123975 | .855 
.356 | 1.062519 | .226862 | .144 | 1.57543 | .26503 | .123264 | .856 


“357, | 1.066938 | 225798 || 143 | 1.57901 | .20347 | .122551 | .857 
EB SON ELLO7 137, Tae 2A eo) Ae aed. 59250) e20102. be.121536) |) .858 
-359 | 1.075837 | .223655 | .141 | 1.58621 | .26037 | .I2I119 | .859 


.222577 1.58983 


.120400 


1.080319 


382 STATISTICAL METHOD 
.360 .I40 .860 


Z a bq 


1.080319 | .222577 | . ; ‘ .120400 


.084823 | .221494 | . : F .119679 
.089349 | .220407 | . : : .118956 
.093897 | .219315 | . : : .118231 


.098468 | .218219] . .6045 é .117504 
.103063 | .217119 | . : : .116775 
.107680 | .216013 | . 4 22 .116044 


.112321 | .214903 | . F : -II5311 
.116987 | .213789 | .1I : 2 .114576 
.121676 | .212669 | . 62; 242 -113839 


.126391 | .211545 | . ; 4 .113100 


SZC ES La e2cOA TO: |e. : : 5 .112359 
.135896 | .209283 | . 0: : -III1616 
-140688 | .208145 ] . f e -110871 


.145505 | .207001 | . -642 : -IIOI24 
.150349 | .205853 ] . : : .109375 
“155221 ||-204701 | . : : .108624 


FLOOT ZO" 203543)", 65482 : .107871 
.165047 | .202380 | . : 5 : -107116 
.170002 | .201213 | . -66292 | . .106359 


-174987 | .200040] . P ‘ .105600 


-180001 | .198863 | . : Zeer .104839 
.185044 | .197680 | . .67525 LP .104076 
-IQOI18 | .196493 | . : : -103311 
.195223 | .195300 | . ‘ .2209- .102544 
-200359 | .I94102] .1I¢ 68785 | . -101775 
.205527 | .192900 | . (OO2T IL ie -101004 
210727 ae LOLOOT. matic .69638 | . -100231 
.215960 | .190478 | . -70070 |..214° -099456 
1220227 | 180259 |). -70504 | .2 .098679 


-226528 | .188036 | . -JO94I ris -097900 


.231864 | .186806 | . .71382 : .O97119 
237235) | ekoopzei| < sflo2O see -096336 
.242642 | .184332] . e722 hd ollie -095551 


248085 | .183087 | . 272A alee .094764 
-253565 | .181836] . Semi NS -093975 
-259084 | .180579 | . 173034 se .093184 


.264641 | .179318 | .103 -74095 | .19991 .092391 
270237 0780501). -74559 | .19827 | .og1596 
275874 176777 ‘ .75027 .19664 | .090799 


-175498 | . 75498 | .19500 | .ogoo00 


APPENDIX -C 383 
-400 -100 -Q00 
i | x eens) Feet) Pgs ap 
-400 | 1.281552 | .175498 | .100 | 1.7550 19500 "090000 900 
401 | 1.287271 | .174214 | .099 | 1.7507 19336 | .o89199 | .9OI 
-402 | 1.293032 | .172924 | .098 | 1.7645 I9I7I | .088396 | .902 
-403, | 1.298837 | .171628 | .097 | 1.7694 19006 | .087591 | .903 
.404 | 1.304685 | .170326 | .096 | 1.7742 18841 | .086784 | .904 
405 | 1.310579 | .169018 | .095 | 1.7791 18676 | .085975 | -905 
.406 | 1.316519 | .167705 | .094 | 1.7841 18510 | .085164 | .906 
407 | 1.322505 | .166385 | .093 } 1.7891 18345 | .084351 | .907 
.408 | 1.328539 | .165060 | .09g2 | 1.7941 18178 | .083536 | .908 
.409 | 1.334622 | .163728 | .ogI 1.7992 18012 | .082719 | .909 
410 | 1.340755 | .162391 | .ogo | 1.8043 17845 | .081900 gio 
-AII | 1.346939 | .161047 | .089 | 1.8095 17678 | .081079 | .9II 
412 | 1.353174 | .159697 | .o88 | 1.8147 17511 | .080256 | .gI2 
-413 | 1.359463 | .158340 | .087 | 1.8200 17343 | .079431 | -913 
414 | 1.365806 | .156978 | .086 | 1.8253 17175 | .078604 | .914 
-4AI5 | 1.372204 | .155609 | .085 | 1.8307 17006 | .077775 | -915 
416 | 1.378659 | .154233 | .084 | 1.8361 16838 | .076944 | .916 
417 | 1.385172 | .152851 | .083 | 1.8416 16669 | .O761II | .917 
418 | 1.391744 | .151463 | .082 | 1.8471 16499 | .075276 | .g18 
419 | 1.398377 | .150068 | .o81 1.8527 16329 | .074439 | -919 
.420 | 1.405072 | .148666 | .o80 | 1.8582 16159 | .073600 920 
421 ) 1.411830 | .147258 | .079 | 1.8640 15989 | .072759 g21 
.422 | 1.418654 | .145843 | .078 | 1.8698 15818 | .071916 g22 
423 | 1.425544 | .144420 | .077 1.8756 15647 | .071071 923 
.424 | 1.432503 | .142991 | .076 | 1.8815 15475 | .070224 | .924 
425 | 1.439531 | .141555 | .075 | 1.8874 15303 | 069375 | -925 
.426 | 1.446632 | .140112 | .074 | 1.8934 15131 | .068524 | .926 
.427 | 1.453806 | .138662 | .073 | 1.8995 14958 | .067671 | .927 
428 | 1.461056 | .137205 | .072 | 1.9056 14785 | .066816 | .928 
429 | 1.468384 | .135740 | .o71 | 1.9118 14611 | .065959 | -.929 
.430 | 1.475791 | .134268 | .o7o | 1.9181 14437 | .065100 | .930 
431 | 1.483280 | .132788 | .069 | 1.9245 14263 | .064239 | -931 
.432 | 1.490853 | .131301 | .068 | 1.9309 14088 | .063376 | .932 
433 1.498513 | .129807 | .067 | 1.9374 13913 | .062511 | .933 
.434 | 1.506262 | .128304 | .066 | 1.9440 13737 | .061644 | .934 
.435 | 1.514102 | .126794 | .065 | 1.9507 13561 | .060775 | -935 
.436 | 1.522036 | .125276 | .064 | 1.9574 13384 | .059904 | -936 
437 | 1.530068 | .123750 | .063 1.9643 13207 | .059031 | .937 
.438 | 1.538199 | .122216 | .062 | 1.9712 13029 | .058156 | .938 
.439 | 1.546433 | .120674 | .061 | 1.9783 12851 | .057279 | .939 
440 | 1.554774 | 119123 | .060 | 1.9854 12673 | .056400 | .940 
-440 .060 -940 


384 STATISTICAL METHOD 
-440 .060 .940 


x Z 


-440 | 1.554774 | .II9I23 ] . : .0564.00 


441 | 1.563224 , .117564]| . : F .055519 
-442 | 1.571787 | .115996 | . ; . .054636 
-443 | 1.580467 | .114420] . : : -053751 


-444 | 1.589268 | .112836] . : j .052864 
-445 | 1.598193 | .111242] . . : -051975 
-446 | 1.607248 | .109639 | . : : .051084 


-447 | 1.616436 | .108027 | . : ? -O50191 
448 | 1.625763 | .106406 | . ; : .049296 
| -449 | 1.635234 | .104776 | . : : -048399 


-450 | 1.644854 | .103136 | . ; ; -047500 


-451 | 1.654628 | .101486 | . s : -046599 
-452 | 1.664563 | .099826 | . 4 : -045696 
-453 | 1.674665 | .098157 | . : : .044791 


-454 | 1.684941 | .096477 | . A : .043884 
455 | 1.695398 | .094787 | . ; 042975 
-456 | 1.706044 | .093086 | . : : .042064 


‘457 | 1.716886 | .091375 .O4II51 
-458 | 1.727934 | .089652 .040236 
459 | 1.739198 | .087919 -039319 


.460 | 1.750686 | .086174 .038400 


-037479 
-036556 
-035631 


-034704 
-033775 
-032844 


-OZ1QII 
-030976 
.030039 


tNoN be 


to 


461 | 1.762410 | .084417 
-462 | 1.774382 | .082649 
-463 | 1.786614 | .080868 


-464 | 1.799118 | .079075 
-465 | 1.811911 | .077270 
-466 | 1.825007 | .075452 
467 | 1.838424 | .073620 
468 | 1.852180 | .071775 
-469 | 1.866296 | .069915 


ty be tn wb 


Nn bh 


-470 | 1.880794 | .o68042 | . 2. ‘ -029100 


-471 | 1.895698 | .066154 | . .2812 : -028159 
-472 | 1.911036 | .064250 | .028 294 ; .027216 
+473 | 1.926837 | .062332 | .02 : : .026271 
-474 | 1.943134 | .060397 | . 32 a .025324 


475 | 1.959964 | .058445 | .025 2. . 024375 
-476 | 1.977368 | .056476 | . : i .023424 


‘477 | 1.995393 | .054490 | . . : 022471 
-478 | 2.014091 | .052485 | .02 ; : .O21516 
-479 | 2.033520 | .050462 | . : : -020559 


-480 | 2.053749 | .048418 


2.053749 


2.074855 
2.096927 
2.120072 


-I4441I 
.170090 
2.197286 


2226211 


2.257129 
2.290370 


3 


APPENDIX C 
-020 


048418 


046354 
-044268 
-042160 


-040028 
-037870 
.035687 


-033475 
.031234 
-028960 


.019600 


-018639 
.017676 
.O1671I 


-015744 
-O14775 
-013804. 


.012831 
-O11856 
-010879 


2.326348 


.026652 


2.365618 
2.408916 
2.457264 


2.512144 
2.575829 
2.652070 


2.747781 
2.878161 


3.090229 


-024306 
-021920 


-019487 


-017003 
.014460 
-O11847 


.009149 
-006340 


.003367 


.009900 


.008919 


-007936 
-006951 


-005964 
-004975 
-003984 
.002991 
.001996 
.000999 


INDEX 


Boldface is used for references to definitions. 


Alienation, coefficient of, 173 
see also Correlation 

Alignment chart of correlation functions, 
291-295, inside back end paper 

American Society of Mechanical Engi- 
neers, 42 

Anderson, von, O., 271, 276 

Angell, Frank, 147 

Approximations, errors in, 164-167 

Array, 154, 155 

Attenuation, 204-205 

Average, moving, 28 

Averages, 44-69 


Bar diagram, 38 

Bell, Julia, 265 

Best fit, 159 

Blakeman, John, 239, 269 

Block diagram, 40 

Boas, Franz, 259 

Bowley, A. L., 55 

Bravais, A., 152 

Bridges, Calvin B., 321 

Broad categories, effect of, 167-171 
Brown, Carl, 226, 227 

Brown, William, 37, 47, 190, 203, 205, 326 


Canning, J. W., 183 

Caption, 6 

Categorical measures, graphic representa- 
tion of, 37-43 

Cave, Beatrice M., 271, 272, 273 

Gaveviei 270 

Central tendencies, 44-69 

Charlier, C. V. L., 123 

Chart of ratios, 22, 23-27 

Chart, relative time, 16-20, 18 

Chart, time, 16, 17 

Charts, summary of rules for construction 
of, 42-43 

Class index, 11-13, 168-169 

Class interval, 11, 13 

Class limits, 11, 12 


Class mean, 168-169 
Cobb, Margaret V., 314 
Comparable measures, 109-122, 153 
percentile method, 118-122 
ratio method, 110-114 
standard measure method, 114-117 
Contingency 
see Correlation 
Correlated measures, functions of, 196-230 
Correlation, average inter-, 217-221 
Correlation between 
a mean and a cell frequency, 178 
amean and coefficient of correlation, 178 
a mean and standard deviation, 178 
any two product movements, 175 
coefficients of correlation, 179 
means, 178 
standard deviations, 178 
standard deviation and coefficient of 
correlation, 178 
sums or averages, 196-200 
Correlation coefficient, product-moment, 
161-164 
calculation of, 179-181 
corrections to, 171 
Correlation, corrected for attenuation, 
204-205 
error in, 208-212 
Correlation, effect of range upon, 221-230 
Correlation, interpretation of, r89-190 
graphic, 153-156 
Correlation, partial and multiple, 279-310 
multiple alienation coefficient, 288, 
299-300 
multiple correlation coefficient, 287 
multiple, three variables only, 280-205 
multiple, # variables, 283, 202, 204, 
295-310 
partial alienation coefficient, 289 
partial correlation coefficient, 289, 290, 
298 
by successive approximations, 302~310 


387 


388 INDEX 


Correlation surface, normai, 156, 157-159 
Correlation surfaces, 172 
Correlation table, 154 
Correlation, various measures of, 231-278 
bi-serial eta, 249-253 
bi-serial 7, 245-249 
contingency, 262-265 
contingency, coefficient of, 265-271 
contingency, corrections to coefficient 
of, 267-271 
contingency, partial, 280 
contingency, multiple, 280 
equi-probable r, 265 
four-fold point surface, 259-260 
mean square contingency, 265-271 
non-rectilinear regression, 185-189 
Otis’ rank relation, 234-237 
parabolic regression, 245 
¢, see Correlation, four-fold point sur- 
face 
rank method, 191-194 
ratio, correlation, 238-245 
ratio, correlation, corrected, 241, 242, 
244 
ratio, correlation, multiple, 280 
ratio, correlation, partial, 280 
Thorndike’s median ratio coefficient, 
231-234 
tetrachoric, 253-258 
variate difference, 271-278, 280 
Yule’s coefficient of association, 260— 
262 
Yule’s coefficient of colligation, 260-262 
Correlation with true measures, 200-201, 
204 
Cross-over value of a chromosome section, 
321-324 
Curve fitting, 123-150 
normal curve, 136 
type II, 136-137 
type III, 137-138 
type V, 138 
type VII, 137 
Curves, types of, 128-135 


Day, Edmund E., 2 

Deviation, mean, 70-75, 96 

Deviation, quartile, 34, 75 

Deviation, 10-90 percentile range, 34, 75— 
77 


Deviation, standard, 77-82 
Deviation, standard, of constants of 
single series 
of a class frequency, 86-92 
of any moment, 84-86 
of index numbers, 334-339, 340 
of an interpercentile range, 76 
of the mean, 82-83, 177 
of the median, 90 
of measure of Kurtosis, 77 
of measure of Skewness, 77 
of a percentile, 86-92 
of 10-90 percentile range, 76 
of the standard deviation, 176 
Deviation, standard, of measures of cor- 
relation 
of bi-serial eta, 250 
of bi-serial 7, 249 
of coefficient of contingency, 260 
of correlation ratio, 241 
of multiple coefficient of correlation, 
301-302 
of partial coefficient of correlation, 
301 
of product-moment coefficient of cor- 
relation, 176 
of r corrected for attenuation, 209-210 
of y inferred from an ¢ obtained in a 
different range, 316 
of rank coefficient of correlation, 194 
of regression coefficient, 176, 301, 302 
of tetrachoric coefficient of correlation, 
257-258 
of ¢, 262, 260 
of variate difference correlation co- 
efficient, 276-277 
Deviation, standard, of a difference, 182 
Deviation, standard, of an array, 155, 173 
of an array mean, 177 
Deviation, standard, of an estimated 
measure, 300 
Deviation, standard, of any product 
moment, 175 
Dickson, J. D. Hamilton, 156 
Dispersion, 44, 70-93 
Duffell, J. H., 137 


Edgeworth, F. Y., 123, 152, 333 
Elderton, W. Palin, 47, 124, 264 
Error, probable, 98 


INDEX 


Error, probable 

see Deviation, standard 
Error, standard 

see Deviation, standard 
Everitt, P. F., 254, 255 


Fechner, G. T., 326, 327 

Filon, L. N. G., 176, 179 

Fisher, Irving, 332, 333, 335) 339, 341, 
342 

Forsyth evaluation of the Gamma func- 
tion, 136 

Frequency polygon, 9, 10, 11-15 


Galton, Francis, 114, 152, 153, 155 
Gauss, C. F., 153 

Graphic methods, 9-43 

Greek alphabet, 356 

Grouping, 50, 167 

Grouping, effect upon correlation, 167-171 
Grouping, rule for, 52 

Grove, C. C., Preface, p. vi 

Growth curve, 34 

Growth increments, 35-37 


Haskell, Allen, C., 42 

Heron, David, 259, 261, 262, 266 
Herring, John P., 72 

Histogram, 9, 10, 11-12 
Holzinger, Karl J., 222 
Homoclisy, 172 
Homoscedasticity, 172 

Hooker, R. H., 271 


Indexes, 66-67 

Index numbers, 331-347 
change of base of, 346-347 
flexibility of geometric, 339-341 
meaning of, 333-334 
tests of, 341-346 

Isserlis, L., 179, 188 


Kapteyn, J. C., 123 

Kelley, Lura, 97 

Kelley, Truman L., 75, 90, 97, 115, 173, 
213, 221, 223, 201, 207, 208, 321, 333, 
338, 342 

Kelley-Wood table, 97, 370-385 

Knibbs, G. H., 333 

Kurtosis, 45, 77 


389 


| Labelling classes, 53 


Lee, Alice, 255, 311 
Lengthening tests, effect of, 205-208 


Map diagram, 39-41 


Mean 
arithmetic, 44, 45-53 
geometric, 65-66 
guessed, 48 
harmonic, 63-64 
Median, 34, 54-57 
Mitchell, Wesley C., 333, 339 
Mode, 34, 60-62 
Moments, 48, 79 
Miller, G. E., 327 
Mutilated distributions, 
311-314 
Mutilated distributions, correlation in, 
314-316 


constants of, 


Normal curve, fitting a, 136 
Normal distribution, 94-108, 


145, 140 
Normal distribution, unit, 99-100, 350 


120-130, 


Ogive, 31-34 

Origin, arbitrary, 48 

Otis, Arthur S., 118, 234, 237 

Overlapping, error in measures of, 213, 
316-319 


Pearson, Karl, Preface, pp. v and vi, 94, 
QQ, 123, 124, 125, 135, 1375 138, I40, 
- I4T, 143, 152, 153, 160, 172, 174, 175, 
176, 179, 193, 194, 225, 220, 231, 239, 
241, 248, 240, 250, 253, 254, 2575 259 
261, 262, 264, 265, 266, 268, 260, 271, 
272, 273, 311, 333, 330 

Percentiles, 34, 57-59 


| Perry, C. A., 39 


Persons, Warren M., 271 
Pintner, Rudolph, 115 
Population, 44 


| Probability of exceeding a given diverg- 


ence, 102-103 
Probable error, 98 
Probable error 
see Deviation, standard 
Probable error of estimate 
see Deviation, standard, of an array 
Product theorem in correlation, 84 


399° 


Product theorem in probabilities, 262 
Psychophysical methods, 326-330 


Ratios, 66-67, 110-114 
Regression, 152, 154 
see also Correlation 
Regression coefficients, 160-161, 181-185 
conjugate, 298 
3 variables, 283, 285-294 
n variables, 283, 292, 294, 295, 296-302 
Regression equation, 161 
3-variables, 281, 283 
n-variables, 283, 295-310 
Relationship, measures of, 151—1)% 
see Correlation 
Reliability coefficient, 200-203 
Residual, definition of, 281, 284 
Reversion, 152, 154 
Rhind, A., 143 
Rich, Willis H., 273 
Richmond, H. A., 185 
Rietz, H. L., Preface, p. vi, 189 
Ritchie-Scott, A., 271 
Rugg, Harold O., 39 


Scatter diagram 
see Correlation table 
Series, statistical, 2-5 
complex, 39-41 
qualitative, 2, 5 
quantitative, 2, 5 
spacial and geographical, 2, 4 
temporal, 2-3 
Sheppard, W. F., 94, 125, 168, 169, 174, 
176, 257 
Similar forms, 201-203 
Skewness, 44, 77 
Smoothing data, 27-31 
Soper, H. E., 177, 248, 249 
Spearman, Charles, 193, 106, 
205, 210 
Stability of distributions, 138-150 
Standard error, 83 


203-204, 


INDEX 


Standard error of estimate 
see Deviation, standard, of an array 
Standard measures, 115, 280 
Stub, 6 
Student, 243, 271, 272 
Symbols, list of important, 349-356 


Tables, statistical, 5-8 
derived, 7 
general purpose, 7 
primary, 7 
special purpose, 7 
Thiele, T. N., 123 
Thomson, Godfrey H., 37, 47, 190, 203, 
326, 327 
Thorndike, E. L., Preface, p. vi, 231, 234 
Thurstone, L. L., 199 
Trade test evaluation, 320-321 
True scores, 200 
estimates of, 212-216 
error in estimates of, 212-216 


Unit normal distribution, 99-100 
Unstable distributions, 146-150 
Urban, F. M., 327, 328 


Variability 
see Dispersion 


Wald, Elva, 72 

Walsh, C. M., 333 

Weighted average, best, 324-325 
Weighting, 67-68 

Weighting, effect of, 199-200 
Weightings, merit of fixt, 319-320 
Weldon, W. F. R., 189 

Whipple, G. M., 111 

Wood, Ben D., 97 

Woodyard, Ella, 309 


Yerkes, Robert M., 226, 228, 246, 250, 
255, 200, 309 
Yule, G. U., 160, 210, 259,.260, 261, 262 


ial une vv dv RPA AAI 
ot <r 3 a 
|. ‘ F in 


z 


w~ 
Ss WY WERDER i] HUET] BITTTTH PT TT (IT TTT (| ' fl 
S 
c oo:so So —) ° [—) = ° i) @® oS r=) So —) i) i) o J 
— a) So o S So Oo So —) wo — i) o c—) So —] S So wo 
= f=>] ao ~ oOo wm vr oO ™ —_ - ao o ~ wo wo os oO N = 
’ 
S o [=] [—) [=] e o o [—) i—J 
»2 So S —) o So i) So o wo 
Pas S i) o 5 o © + ) aN - 


